Chinese text computing |
Jun Da at lingua.mtsu.edu | Home | Academic | Chinese computing | Learning Chinese | CALL | System admin | Contact me |
|
Technical reportUnder preparation! Page last updated: 2010-09-16In the meantime, please refer to my paper (pdf format) for detailed information about this project: Note that the pdf file contains Simplified Chinese characters. You need to have Acrobat Chinese support package installed on your computer in order to view the file properly. For help, check out http://www.adobe.com/products/acrobat/acrrasianfontpack.html. 1. Data collection1.1 Overview 1.3 Sampling method 2. Data processing2.1 The data set 2.2 Procedure 2.2.1 Pre-processing 2.2.2 Segmenting characters 2.2.3 Making bigrams 3. Results3.1 Character frequencies 3.2 Bigram frequencies and other statistics 4. Discussions5. Further information
|