site stats

Subtlex-ch语料库

Web2 Jun 2010 · Our results confirm that word frequencies based on subtitles are a good estimate of daily language exposure and capture much of the variance in word processing … Web14 Jul 2015 · In Chinese, the new frequency that correlated the strongest with Subtlex-CH was Twitter (.879). Overall, the results showed that when we want to use a frequency similar to books or HAL, it is better to use blog frequencies. When we want to use a frequency similar to spoken or subtitle frequencies, it is better to use Twitter frequencies.

(PDF) SUBTLEX-CH: Chinese word and character frequencies

Web2 Jun 2010 · SUBTLEX-CH covered 200 of the 201 words, while LCSMCS covered 189 words, and LCMC covered 199 words. We ran the correlation and regression analyses on the 187 words that were covered by all frequency measures. Correlations between the RTs and the frequencies were -.654 for SUBTL_logW, -.654 for SUBTL_logW-CD, -.370 for … Web2 Jun 2010 · Examination of SUBTLEX-GR, a subtitled-based corpus consisting of more than 27 million Modern Greek words, showed that frequencies estimated from a subtitle … meredith cranmer https://felixpitre.com

[PDF] SUBTLEX-CH: Chinese Word and Character Frequencies …

Web20 Mar 2024 · SUBTLEX-CAT is a word frequency and contextual diversity database for Catalan, obtained from a 278-million-word corpus based on subtitles supplied from broadcast Catalan television. Like all previous SUBTLEX corpora, it comprises subtitles from films and TV series. In addition, it includes a wider range of TV shows (e.g., news, … WebSee SUBTLEX-CH for word frequencies based on Chinese subtitles. See SUBTLEX-ESP for word frequencies based on Spanish subtitles. See SUBTLEX-DE for word frequencies … meredith crane capital health

语料库 - 维基百科,自由的百科全书

Category:语料库 - 维基百科,自由的百科全书

Tags:Subtlex-ch语料库

Subtlex-ch语料库

Effects of Character and Word Contextual Diversity in Chinese Beginning …

Web7 Mar 2012 · The SUBTLEX-US corpus has been parsed with the CLAWS tagger, so that researchers have information about the possible word classes (parts-of-speech, or PoSs) of the entries. Five new columns have ... WebThe second release of CELEX contains an enhanced, expanded version of the German lexical database (2.5), featuring approximately 1,000 new lemma entries, revised morphological …

Subtlex-ch语料库

Did you know?

Web英国国家语料库(British National Corpus)是目前世界上非常有代表性的当代英语语料库之一,由英国牛津出版社、朗文出版公司、牛津大学计算机服务中心、兰卡斯特大学英语计算机中心以及大英图书馆等联合开发建立。. 以来源广泛的书面语和口语为样本,呈现了 ... Web你一定要收藏的语料库资源. 、提及语料库,学语言的童鞋们一定不陌生。. 这些语言材料的大集合不仅能帮助我们研究语言的各种现象,还能在计算机辅助翻译工具中辅助我们的翻 …

WebThe character frequency ranged from 19.9 to 8881.9 per million (mean = 1033.4 per million), which were assessed according to the SUBTLEX-CH frequency list (Cai and Brysbaert, 2010). The other 15 ... Web2 Jun 2010 · SUBTLEX is a zipped file including three files (SUBTLEX-CH-WF, SUBTLEX-CH-CHR, SUBTLEX-CH-WF_PoS) providing word and character frequency measures based on …

Web2 Jun 2010 · This study presents a subtitle-based word frequency list for Spanish, one of the most widely spoken languages, and finds that the subtitle frequencies explained 6% more of the variance than the existing written frequencies in lexical decision, and 2% extra in word naming. 160. PDF. View 2 excerpts, cites methods. WebSUBTLEX-CH-WF.zip : download the word frequencies. SUBTLEX-CH-WF_PoS.zip : d ownload the word frequencies according to the different syntactic roles of the words. We …

Web30 Dec 2010 · subtlex-ch提供基于影视字幕语料库的简体中文词频和字频。 与日渐增长的研究需求相比,可获取的中文词频资源匮乏,尤其是多字词的词频资源。 因此,我们建立 …

Web知乎,中文互联网高质量的问答社区和创作者聚集的原创内容平台,于 2011 年 1 月正式上线,以「让人们更好的分享知识、经验和见解,找到自己的解答」为品牌使命。知乎凭借认 … meredith crawford musicWebHack Chinese™ Official. All Lists /. Frequency Lists / SUBTLEX-CH Words. SUBTLEX-CH Words. Chinese word frequencies based on subtitles. Words 1-100 Words 101-200 Words 201-300 Words 301-400 Words 401-500 Words 501-600 Words 601-700 Words 701-800 Words 801-900 Words 901-1000 Words 1001-1100 Words 1101-1200 Words 1201-1300 … how old is sofia the first nowWeb7 Mar 2024 · 1.打开页面进入北京大学中国语言文学研究中心选择古汉、现汉,可根据需要选择进入普通、批量、模式查询检索。. 2.CCL语料库语料分类分布情况、语料库文件详细 … meredith credit unionWeb23 Dec 2010 · 基于中文詞匯word naming和lexical decision的實驗數據,与现存幾個词频表的詞頻进行了比较,显示這些詞頻对RT的解释作用最优。. 這里我們提供三個頻率表的完 … meredith craighttp://crr.ugent.be/programs-data/subtitle-frequencies/subtlex-ch howoldissolangeWebSUBTLEX-UK: A cleaned Excel file with word frequencies for 160,022 word types (also available as a text file). This file is ideal for those who want to use British word … meredith cox scalesWeb2 Jun 2010 · The character frequency ranged from 19.9 to 8881.9 per million (mean = 1033.4 per million), which were assessed according to the SUBTLEX-CH frequency list (Cai and Brysbaert, 2010). The other 15 ... meredith creek richmond va