Registro completo de metadatos
Campo DC Valor Lengua/Idioma
dc.provenanceComisión de Investigaciones Científicas-
dc.contributorFerretti, Edgardo-
dc.contributorSoria, Matías-
dc.contributorPérez Casseignau, Sebastián-
dc.contributorPohn, Lian-
dc.contributorUrquiza, Guido-
dc.contributorGómez, Sergio Alejandro-
dc.contributorErrecalde, Marcelo-
dc.creatorFerretti, Edgardo-
dc.creatorSoria, Matías-
dc.creatorPérez Casseignau, Sebastián-
dc.creatorPohn, Lian-
dc.creatorUrquiza, Guido-
dc.creatorGómez, Sergio Alejandro-
dc.creatorErrecalde, Marcelo-
dc.date2017-04-
dc.date.accessioned2019-04-29T16:13:24Z-
dc.date.available2019-04-29T16:13:24Z-
dc.date.issued2017-04-
dc.identifierhttp://digital.cic.gba.gob.ar/handle/11746/5668-
dc.identifierRecurso completo-
dc.identifier.urihttp://rodna.bn.gov.ar:8080/jspui/handle/bnmm/311959-
dc.descriptionFeatured Articles (FA) are considered to be the best articles that Wikipedia has to offer and in the last years, researchers have found interesting to analyze whether and how they can be distinguished from “ordinary” articles. Likewise, identifying what issues have to be enhanced or fixed in ordinary articles in order to improve their quality is a recent key research trend. Most of the approaches developed to face these information quality problems have been proposed for the English Wikipedia. However, few efforts have been accomplished in Spanish Wikipedia, despite being Spanish, one of the most spoken languages in the world by native speakers. In this respect, we present a breakdown of Spanish Wikipedia’s quality flaw structure. Besides, we carry out studies with three different corpora to automatically assess information quality in Spanish Wikipedia, where FA identification is evaluated as a binary classification task. Our evaluation on a unified setting allows to compare with the English version, the performance achieved by our approach on the Spanish version. The best results obtained show that FA identification in Spanish, can be performed with an F1 score of 0.88 using a document model consisting of only twenty six features and Support Vector Machine as classification algorithm.-
dc.formatapplication/pdf-
dc.formatp. 29-36-
dc.languageeng-
dc.rightsinfo:eu-repo/semantics/openAccess-
dc.rightsAttribution 4.0 International (BY 4.0)-
dc.sourcereponame:CIC Digital (CICBA)-
dc.sourceinstname:Comisión de Investigaciones Científicas de la Provincia de Buenos Aires-
dc.sourceinstacron:CICBA-
dc.source.urihttp://digital.cic.gba.gob.ar/handle/11746/5668-
dc.source.uriRecurso completo-
dc.subjectCiencias de la Computación e Información-
dc.titleTowards Information Quality Assurance in Spanish: Wikipedia-
dc.typeinfo:eu-repo/semantics/article-
dc.typeinfo:eu-repo/semantics/publishedVersion-
dc.typeinfo:ar-repo/semantics/articulo-
Aparece en las colecciones: Comisión de Investigaciones Científicas de la Prov. de Buenos Aires

Ficheros en este ítem:
No hay ficheros asociados a este ítem.