PaddlePaddle/uie-m-large

Information extraction suffers from its varying targets, heterogeneous structures, and demand-specific schemas. The unified text-to-structure generation framework, namely UIE, can universally model different IE tasks, adaptively generate targeted structures, and collaboratively learn general IE abilities from different knowledge sources. Specifically, UIE uniformly encodes different extraction structures via a structured extraction language, adaptively generates target extractions via a schema-based prompt mechanism - structural schema instructor, and captures the common IE abilities via a large-scale pre-trained text-to-structure model. Experiments show that UIE achieved the state-of-the-art performance on 4 IE tasks, 13 datasets, and on all supervised, low-resource, and few-shot settings for a wide range of entity, relation, event and sentiment extraction tasks and their unification. These results verified the effectiveness, universality, and transferability of UIE.

UIE Paper: https://arxiv.org/abs/2203.12277

PaddleNLP released UIE model series for Information Extraction of texts and multi-modal documents which use the ERNIE 3.0 models as the pre-trained language models and were finetuned on a large amount of information extraction data.

Available Models

Model Name	Usage Scenarios	Supporting Tasks
`uie-base` `uie-medium` `uie-mini` `uie-micro` `uie-nano`	For plain text The extractive model of the scene supports Chinese	Supports entity, relation, event, opinion extraction
`uie-base-en`	An extractive model for plain text scenarios, supports English	Supports entity, relation, event, opinion extraction
`uie-m-base` `uie-m-large`	An extractive model for plain text scenarios, supporting Chinese and English	Supports entity, relation, event, opinion extraction
`uie-x-base`	An extractive model for plain text and document scenarios, supports Chinese and English	Supports entity, relation, event, opinion extraction on both plain text and documents/pictures/tables

Performance on Text Dataset

We conducted experiments on the in-house test sets of the three different domains of Internet, medical care, and finance:

	finance		healthcare		internet
	0-shot	5-shot	0-shot	5-shot	0-shot	5-shot
uie-base (12L768H)	46.43	70.92	71.83	85.72	78.33	81.86
uie-medium (6L768H)	41.11	64.53	65.40	75.72	78.32	79.68
uie-mini (6L384H)	37.04	64.65	60.50	78.36	72.09	76.38
uie-micro (4L384H)	37.53	62.11	57.04	75.92	66.00	70.22
uie-nano (4L312H)	38.94	66.83	48.29	76.74	62.86	72.35
uie-m-large (24L1024H)	49.35	74.55	70.50	92.66	78.49	83.02
uie-m-base (12L768H)	38.46	74.31	63.37	87.32	76.27	80.13
🧾🎓uie-x-base (12L768H)	48.84	73.87	65.60	88.81	79.36	81.65

0-shot means that no training data is directly used for prediction through paddlenlp.Taskflow, and 5-shot means that each category contains 5 pieces of labeled data for model fine-tuning. Experiments show that UIE can further improve the performance with a small amount of data (few-shot).

Performance on Multimodal Datasets**

We experimented on the zero-shot performance of UIE-X on the in-house multi-modal test sets in three different domains of general, financial, and medical:

	General	Financial	Medical
🧾🎓uie-x-base (12L768H)	65.03	73.51	84.24

The general test set contains complex samples from different fields and is the most difficult task.

Detailed Info: https://github.com/PaddlePaddle/PaddleNLP/blob/develop/applications/information_extraction/README_en.md