Domain adaptation of information extraction models

Gupta, Rahul ; Sarawagi, Sunita (2009) Domain adaptation of information extraction models ACM SIGMOD Record, 37 (4). pp. 35-40. ISSN 0163-5808

[img] PDF
337kB

Official URL: http://doi.org/10.1145/1519103.1519109

Related URL: http://dx.doi.org/10.1145/1519103.1519109

Abstract

Domain adaptation refers to the process of adapting an extraction model trained in one domain to another related domain with only unlabeled data. We present a brief survey of existing methods of retraining models to best exploit labeled data from a related domain. These approaches that involve expensive model retraining are not practical when a large number of new domains have to be handled in an operational setting. We describe our approach for adapting record extraction models that exploits the regularity within a domain to jointly label records without retraining any model.

Item Type:Article
Source:Copyright of this article belongs to ACM, Inc
ID Code:128386
Deposited On:20 Oct 2022 04:03
Last Modified:20 Oct 2022 04:03

Repository Staff Only: item control page