Registro completo de metadatos
Campo DC Valor Lengua/Idioma
dc.creatorHernández Lahme, Damián Gabriel-
dc.date2018-08-10T17:43:35Z-
dc.date2018-08-10T17:43:35Z-
dc.date2015-06-
dc.date2018-08-08T14:10:46Z-
dc.date.accessioned2019-04-29T15:52:44Z-
dc.date.available2019-04-29T15:52:44Z-
dc.date.issued2015-06-
dc.identifierHernández Lahme, Damián Gabriel; Information Approach to Co-occurrence of Words in Written Language; Complex Systems Publications; Complex systems; 24; 2; 6-2015; 1-21-
dc.identifier0891-2513-
dc.identifierhttp://hdl.handle.net/11336/54970-
dc.identifierCONICET Digital-
dc.identifierCONICET-
dc.identifier.urihttp://rodna.bn.gov.ar:8080/jspui/handle/bnmm/304428-
dc.descriptionIn this paper we study the distribution of words across the different parts of a book using tools from information theory. In particular, the mutual information between words in the text and parts of the text is compared with the mutual information of a shuffled version of the book. This analysis allows us to extract not only relevant words of the text but also relationships between the different words, such as co-occurrence and repulsion between them. With the connections due to co-occurrence of words, we show how to construct a network that reflects the semantic organization of the book. This method can be applied to other types of sequences, measuring the relations between the different symbols that compose such sequences.-
dc.descriptionFil: Hernández Lahme, Damián Gabriel. Comisión Nacional de Energía Atómica. Gerencia del Área de Energía Nuclear. Instituto Balseiro; Argentina. Comisión Nacional de Energía Atómica. Centro Atómico Bariloche; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina-
dc.formatapplication/pdf-
dc.formatapplication/pdf-
dc.languageeng-
dc.publisherComplex Systems Publications-
dc.relationinfo:eu-repo/semantics/altIdentifier/url/http://www.complex-systems.com/abstracts/v24_i02_a03/-
dc.rightsinfo:eu-repo/semantics/openAccess-
dc.rightshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/-
dc.sourcereponame:CONICET Digital (CONICET)-
dc.sourceinstname:Consejo Nacional de Investigaciones Científicas y Técnicas-
dc.sourceinstacron:CONICET-
dc.subjectINFORMATION-
dc.subjectCOOCURRENCE-
dc.subjectWORDS-
dc.subjectLANGUAGE-
dc.subjectAstronomía-
dc.subjectCiencias Físicas-
dc.subjectCIENCIAS NATURALES Y EXACTAS-
dc.titleInformation Approach to Co-occurrence of Words in Written Language-
dc.typeinfo:eu-repo/semantics/article-
dc.typeinfo:eu-repo/semantics/publishedVersion-
dc.typeinfo:ar-repo/semantics/articulo-
Aparece en las colecciones: CONICET

Ficheros en este ítem:
No hay ficheros asociados a este ítem.