Search for content and authors
 

Long-range dependences in natural language

Jarosław Kwapień 1Stanisław Drożdż 2,3Sonia Bryłka 4Łukasz Daros 4Rafał Janik 4

1. Polish Academy of Sciences, Institute of Nuclear Physics (IFJ PAN), Radzikowskiego 152, Kraków 31-342, Poland
2. University of Rzeszów, Institute of Physics, Department of Complex Systems, Rejtana 16, Rzeszów 35-310, Poland
3. Polish Academy of Sciences, Institute of Nuclear Physics (IFJ PAN), Radzikowskiego 152, Kraków 31-342, Poland
4. AGH University of Science and Technology, Faculty of Physics and Applied Computer Science (AGH), Mickiewicza 30, Kraków 30-059, Poland

Abstract

Natural language is an emergent phenomenon formed during the process of historical (and personal) self-organization of human brain. It is intuitively expected that properties of natural language reflect to some extent the properties of the brain, especially its methods of information processing and its ability to create messages. However, since this at present cannot be observed directly, one has to select and inspect observables which are easier to analyze. Among such observables related to language are texts.

In order to faciliate studies, the texts can be transformed into symbolic sequences using various mappings like, e.g., lengths of words, number of words in sentences, frequency ranks of words etc. Such sequences can then be studied with standard techniques of time series analysis. Here we look for the nonlinear long-range correlations in literary works of different authors and written in different languages. Literary texts are the ones which can be expected to express human thoughts in the richest manner. Thus, the correlations - if present - should primarily be detectable just there. We apply both the nonlinear variants of autocorrelation function and the multifractal formalism to identify such correlations and review some of the preliminary results of our study.

 

Auxiliary resources (full texts, presentations, posters, etc.)
  1. PRESENTATION: Long-range dependences in natural language, PDF document, version 1.4, 0.8MB
 

Legal notice
  • Legal notice:
 

Related papers

Presentation: Oral at 5 Ogólnopolskie Sympozjum "Fizyka w Ekonomii i Naukach Społecznych", by Jarosław Kwapień
See On-line Journal of 5 Ogólnopolskie Sympozjum "Fizyka w Ekonomii i Naukach Społecznych"

Submitted: 2010-10-12 18:11
Revised:   2010-10-12 18:11