Bande Dessinee Corpus Hermeticum

Une bande dessinée est appelée « bande dessinée numérique » à partir du moment où elle a été mise à disposition sur le web. Une définition bien large qui laisse donc des possibilités infinies de ...

I would read in the BCC corpus frequency list as a dictionary, then Having concatenated all the news/magazine articles as plain text, I would build a dictionary of all the words in the news/magazine articles up to 8 characters long, counting their number of occurrences with the help of the BCC frequency list (which tells us which combinations ...

Word frequency list based on a 15 billion character corpus: BCC (BLCU ...

Bande Dessinee Corpus Hermeticum 3

I guess in my case, I could go with per-corpus flashcard sets to keep the per-corpus tagging, and one user dictionary (without tags) with all the per-corpus ranking info included in one entry per term.

Hello Mike, it occurred to me that it may be worthwhile to add an indicator for the frequency of a word in the upper right corner of a dictionary definition using the frequency data in the BCC corpus, allowing the user to see at a glance how common a word is. The frequency information could be...

The Beijing Language and Culture University created a balanced corpus of 15 billion characters. It’s based on news (人民日报 1946-2018,人民日报海外版 2000-2018), literature (books by 472 authors, including a significant portion of non-Chinese writers), non-fiction books, blog and weibo entries as well as...

Bande Dessinee Corpus Hermeticum 6

With a small corpus of 650 articles from People's Daily, downloaded using a Python script, I hope to start providing a more modern frequency list of media-related vocabulary. The frequency list has the following features: It uses all sections of the 人民日报 / People's Daily newspaper, including the sports section.

Bande Dessinee Corpus Hermeticum 7