Chinese web 5-gram, version 1
"Chinese Web 5-gram Version 1, Linguistic Data Consortium (LDC) catalog number LDC2010T06 and isbn 1-58563-539-1, was created by researchers at Google Inc. It consists of Chinese word n-grams and their observed frequency counts generated from over 800 million tokens of text. The length of the n...
Main Author: | |
---|---|
Corporate Author: | |
Format: | Book |
Language: | Chinese |
Published: |
[Philadelphia, Pa.] :
Linguistic Data Consortium,
[2010]
|
Subjects: |