Unsupervised WSD by finding the predominant sense using context as a dynamic thesaurus

Tejada Cálrcamo, Javier; Calvo, Hiram; Gelbukh, Alex; Hara, Kazuo

Unsupervised WSD by finding the predominant sense using context as a dynamic thesaurus

dc.contributor.author	Tejada Cálrcamo, Javier
dc.contributor.author	Calvo, Hiram
dc.contributor.author	Gelbukh, Alex
dc.contributor.author	Hara, Kazuo
dc.date.accessioned	2019-01-29T22:19:56Z
dc.date.available	2019-01-29T22:19:56Z
dc.date.issued	2010
dc.description.abstract	We present and analyze an unsupervised method for Word Sense Disambiguation (WSD). Our work is based on the method presented by McCarthy et al. in 2004 for finding the predominant sense of each word in the entire corpus. Their maximization algorithm allows weighted terms (similar words) from a distributional thesaurus to accumulate a score for each ambiguous word sense, i.e., the sense with the highest score is chosen based on votes from a weighted list of terms related to the ambiguous word. This list is obtained using the distributional similarity method proposed by Lin Dekang to obtain a thesaurus. In the method of McCarthy et al., every occurrence of the ambiguous word uses the same thesaurus, regardless of the context where the ambiguous word occurs. Our method accounts for the context of a word when determining the sense of an ambiguous word by building the list of distributed similar words based on the syntactic context of the ambiguous word. We obtain a top precision of 77.54% of accuracy versus 67.10% of the original method tested on SemCor. We also analyze the effect of the number of weighted terms in the tasks of finding the Most Frecuent Sense (MFS) and WSD, and experiment with several corpora for building the Word Space Model. © 2010 Springer Science+Business Media, LLC & Science Press, China.	es_PE
dc.description.uri	Trabajo académico	es_PE
dc.identifier.doi	https://doi.org/10.1007/s11390-010-9385-2	es_PE
dc.identifier.issn	10009000	es_PE
dc.identifier.uri	https://hdl.handle.net/20.500.12590/15898
dc.language.iso	eng	es_PE
dc.publisher	Scopus	es_PE
dc.relation.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-78650204798&doi=10.1007%2fs11390-010-9385-2&partnerID=40&md5=99192aac46b7a3081deac681de4bf27b	es_PE
dc.rights	info:eu-repo/semantics/restrictedAccess	es_PE
dc.source	Repositorio Institucional - UCSP	es_PE
dc.source	Universidad Católica San Pablo	es_PE
dc.source	Scopus	es_PE
dc.subject	Distributional similarities	es_PE
dc.subject	Maximization algorithm	es_PE
dc.subject	Semantic similarity	es_PE
dc.subject	Text corpora	es_PE
dc.subject	Unsupervised method	es_PE
dc.subject	Word sense	es_PE
dc.subject	Word Sense Disambiguation	es_PE
dc.subject	Word spaces	es_PE
dc.subject	Semantics	es_PE
dc.subject	Software agents	es_PE
dc.subject	Thesauri	es_PE
dc.subject	Natural language processing systems	es_PE
dc.subject.ocde	https://purl.org/pe-repo/ocde/ford#1.02.00	es_PE
dc.title	Unsupervised WSD by finding the predominant sense using context as a dynamic thesaurus	es_PE
dc.type	info:eu-repo/semantics/article

Collections

Artículos - Ciencias de la Computación

Unsupervised WSD by finding the predominant sense using context as a dynamic thesaurus

Files

Collections