.. .. Default format for syntax highlighting is plain text. .. highlight:: text .. _references: ========== References ========== | DuBay (ed.): The Classic Readability Studies; Impact Information, Costa Mesa (2006) | Accessed 2018-04-20 from | http://www.impact-information.com/impactinfo/research/classics.pdf | McLaughlin, G. H.: SMOG Grading - A New Readability Formula. Journal of Reading, May, 1969. | Accessed 2018-04-22 from | https://ogg.osu.edu/media/documents/health_lit/WRRSMOG_Readability_Formula_G._Harry_McLaughlin__1969_.pdf | Geoffrey Marnell: Measuring Readability | Accessed 2018-04-20 from | https://www.abelard.com.au/readability%20statistics.pdf | Grefenstette, G. and Tapanainen, P.: What is a word, what is a sentence? problems of tokenization (1994) | Accessed 2018-04-20 from | http://www.georgetown.edu/faculty/wilsong/IR/grefenstette94.pdf | Kiss & Strunk: Unsupervised Multilingual Sentence Boundary Detection | Computational Linguistics, Volume 32 Issue 4, December 2006, Pages 485-525 | (This is the basis of the Punkt system) | Accessed 2018-04-22 from | https://s3.amazonaws.com/tm-town-nlp-resources/ks2005FINAL.pdf | Also found at: | https://pdfs.semanticscholar.org/438b/468f05de91ce583d5dc504f0fbaa0a598626.pdf | A version that appears to be older is at a site of one of the authors: | http://www.linguistics.ruhr-uni-bochum.de/~kiss/publications/compling2005_KS27.01final.pdf | Jordan Boyd-Graber post on the nltk-users Google group | Accessed 2018-04-20 from | https://groups.google.com/forum/#!msg/nltk-users/mCOh_u7V8_I/HsBNcLYM54EJ | Hettinger, Raymond post of Python Bloom filter code to the ActiveState Python Cookbook | Accessed 2018-04-22 from | http://code.activestate.com/recipes/577684-bloom-filter/ | "Moved" to https://github.com/ActiveState/code/tree/master/recipes/Python/577684_Bloom_Filter | But unfortunately, Hettinger's helpful discussion of the code (at the first URI) has not been transferred to GitHub. I have included it in ``util/Bloom_filter_Hettinger.py`` in the source distribution and the GitHub repo of this package. | Burton H. Bloom: Space/Time Trade-offs in Hash Coding with Allowable Errors, Communications of the ACM, Vol. 13 No. 7, July, 1970, pp. 422-426. | Retrieved 2018-04-22 from | http://www.ece.cmu.edu/~ece447/s13/lib/exe/fetch.php?media=p422-bloom.pdf | Kirsch and Mitzenmacher: Less Hashing, Same Performance: Building a Better Bloom Filter (2006). | Accessed 2018-04-22 from | https://www.eecs.harvard.edu/~michaelm/postscripts/rsa2008.pdf | An earlier (much shorter) version is: | http://astrometry.net/svn/trunk/documents/papers/dstn-review/papers/kirsch2006.pdf | Humphries, Glenn: Syllabification: The Division of Words into Syllables. | Accessed long ago from | http://glenn.humphries.com/Notebook/toc.htm (page no longer available) | Cuayáhuitl, Heriberto: A Syllabification Algorithm for Spanish, in Computational Linguistics and Intelligent Text Processing, 5th International Conference, CICLing 2004 Seoul, Korea, February 15-21, 2004 Proceedings, published as Lecture Notes in Computer Science (LNCS) vol. 2945 (Springer), pp. 412-415. | Accessed 2018-04-20 from | https://pdfs.semanticscholar.org/51a1/5db989f4b62a2beb725d71e5db2d210d6141.pdf | Law, Gwillim: Error in the Fernandez Huerta Readability Formula. LINGUIST List 22.2332. | Accessed 2018-03-25 from | https://linguistlist.org/issues/22/22-2332.html | Barrio Cantalejo, Inés Mª: Legibilidad y salud: los métodos de medición de la legibilidad y su aplicación al diseño de folletos educativos sobre salud. (Doctoral Thesis, 2007) | Accessed 2018-04-22 from | https://repositorio.uam.es/bitstream/handle/10486/2488/3907_barrio_cantalejo_ines_maria.pdf