Sumit Neelam, Udit Sharma, et al.
EMNLP 2022
Wikification of large corpora is beneficial for various NLP applications. Existing methods focus on quality performance rather than run-time, and are therefore non-feasible for large data. Here, we introduce RedW, a run-time oriented Wikification solution, based on Wikipedia redirects, that can Wikify massive corpora with competitive performance. We further propose an efficient method for estimating RedW confidence, opening the door for applying more demanding methods only on top of RedW lower-confidence results. Our experimental results support the validity of the proposed approach.
Sumit Neelam, Udit Sharma, et al.
EMNLP 2022
Guilherme Augusto Ferreira Lima, Alexandre Rademaker, et al.
Science of Computer Programming
Scott McCarley, Mihaela Bornea, et al.
AAAI 2023
Thomas Bohnstingl, Ayush Garg, et al.
ICASSP 2022