: The Frequency Dictionary of French by Lonsdale and Le Bras provides structured lists of the most frequent words and is a standard citation for French lexical data. 2. Machine Learning & Summarization (arXiv)
: Research by researchers like Tomi Klein has cited qualitative results from processing a 215,000-word French text. Download 215K French txt
A common reference for a dataset of approximately 215,000 words is an academic paper discussing the processing of the by Lionel Groulx. : The Frequency Dictionary of French by Lonsdale