shengz commited on
Commit
d889b40
1 Parent(s): 7adf48e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -15,6 +15,8 @@ Entity linking faces significant challenges such as prolific variations and prev
15
  This KRISSBERT is initialized with [PubMedBERT](https://huggingface.co/microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract) parameters, and then trained using self-supervised examples that are generated by combining [PubMed](https://pubmed.ncbi.nlm.nih.gov/) abstracts and the [UMLS](https://www.nlm.nih.gov/research/umls/index.html) ontology. Experiments on seven standard biomedical entity linking datasets show that KRISSBERT attains new state of the art, outperforming prior self-supervised methods by as much as 20 absolute points in accuracy.
16
  See [Zhang et al., 2021](https://arxiv.org/abs/2112.07887) for the details.
17
 
 
 
18
  ## Citation
19
 
20
  If you find KRISSBERT useful in your research, please cite the following paper:
 
15
  This KRISSBERT is initialized with [PubMedBERT](https://huggingface.co/microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract) parameters, and then trained using self-supervised examples that are generated by combining [PubMed](https://pubmed.ncbi.nlm.nih.gov/) abstracts and the [UMLS](https://www.nlm.nih.gov/research/umls/index.html) ontology. Experiments on seven standard biomedical entity linking datasets show that KRISSBERT attains new state of the art, outperforming prior self-supervised methods by as much as 20 absolute points in accuracy.
16
  See [Zhang et al., 2021](https://arxiv.org/abs/2112.07887) for the details.
17
 
18
+ Note that some prior work like [BioSyn](https://aclanthology.org/2020.acl-main.335.pdf), [SapBERT](https://aclanthology.org/2021.naacl-main.334.pdf), and their follow-up ([Lai et al., 2021](https://aclanthology.org/2021.findings-emnlp.140.pdf)) claimed to do entity linking, but their systems completely ignore the context of an entity mention, and can only predict a surface form, _**not CUI**_ (See Figure 1 in [BioSyn](https://aclanthology.org/2020.acl-main.335.pdf)). Therefore, they can't disambiguate ambiguous mentions. For instance, given the entity mention "_ER_" in the sentence "*ER crowding has become a wide-spread problem*", their systems predict the nearest entity name (which is also "ER") in the ontology. They can't pinpoint the target entity "*Emergency Room (C0562508)*", because other entities such as "*Estrogen Receptor Gene (C1414461)*" and "*Endoplasmic Reticulum(C0014239)*" also use "ER" as their alias. Without using the context information, their systems can't resolve such ambiguity. Unfortunately, their evaluation considers it a correct prediction, for the reason that "ER" matches one of the aliases of the gold entity, which is problematic. Consequently, the reported results in their papers do not reflect true performance on entity linking.
19
+
20
  ## Citation
21
 
22
  If you find KRISSBERT useful in your research, please cite the following paper: