Chinese text computing
         | | | | | |      
 
 

News corpus

( Page last updated: 2010-09-16 )

Paper

Da, Jun. 2007.The distribution of four-character idioms in Chinese news texts and its implications for CFL learning and instruction. The 6th International Conference on Chinese Language Pedagogy and the First International Conference on Teaching Chinese to American Students. Nanjing, China. (Download pdf)

Da, Jun. 2005. Reading news for information: How much vocabulary a CFL learner should know. International Interdisciplinary Conference on H¨¤nz¨¬ r¨¨nzh¨© - How Western Learners Discover the World of Written Chinese. Germersheim, Germany: August 2005 (Download pdf)

Source

1. Sources of news texts used in this study:
ÊÀ½çÂÛ̳£ºhttp://www.wforum.com/gbindex.html

Frequency distribution

Tools used in this study

GBK character list

Consolidated wordlist based on six online word lists

 

 
Copyright. 1998-2024. Jun Da. jda@mtsu.edu. Page last updated: 2010-09-16