下面是美国语料库的部分截图,我想知道上面第一行的单词分别是什么意思啊?比如CDcount,等

匿名用户
2013-09-24
展开全部
就是些统计的东西。
FREQcount. This is the number of times the word appears in the corpus (i.e., on the total of 51 million words).
CDcount. This is the number of films in which the word appears (i.e., it has a maximum value of 8,388).
FREQlow. This is the number of times the word appears in the corpus starting with a lowercase letter. This allows users to further match their stimuli.
CDlow. This is the number of films in which the word appears starting with a lowercase letter.
SUBTLWF. This is the word frequency per million words. It is the measure you would preferably use in your manuscripts, because it is a standard measure of word frequency independent of the corpus size. It is given with two digits precision, in order not to lose precision of the frequency counts.
Lg10WF. This value is based on log10(FREQcount+1) and has four digit precision. Because FREQcount is based on 51 million words, the following conversions apply for SUBTLEXUS:

http://expsy.ugent.be/subtlexus/
追问
市面上说要求掌握2700个单词,这是美剧中出现频率很高的单词,我想知道怎么让这个2700个单词排列出来啊,是不是选择第一个FREQcount让这个按照升序排列啊?谢谢你的答复
推荐律师服务: 若未解决您的问题,请您详细描述您的问题,通过百度律临进行免费专业咨询

为你推荐:

下载百度知道APP,抢鲜体验
使用百度知道APP,立即抢鲜体验。你的手机镜头里或许有别人想知道的答案。
扫描二维码下载
×

类别

我们会通过消息、邮箱等方式尽快将举报结果通知您。

说明

0/200

提交
取消

辅 助

模 式