WebSep 30, 2024 · Chinese and English are rich-resource language pairs, in order to study low-resource cross-lingual machine reading comprehension (XMRC), besides defining the common XCMRC task which has no restrictions on use of external language resources, we also define the pseudo low-resource XCMRC task by limiting the language resources to … WebIntroduced by Sun et al. in Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension. C3 is a free-form multiple-Choice Chinese machine reading …
machine-reading-comprehension · GitHub Topics · GitHub
WebJul 8, 2016 · Reading comprehension has embraced a booming in recent NLP research. Several institutes have released the Cloze-style reading comprehension data, and these have greatly accelerated the research of machine comprehension. WebDec 13, 2024 · We present Native Chinese Reader (NCR), a new machine reading comprehension (MRC) dataset with particularly long articles in both modern and classical Chinese. NCR is collected from the exam questions for the Chinese course in China's high schools, which are designed to evaluate the language proficiency of native Chinese youth. how do you spell the girls name lily or lilly
XCMRC: Evaluating Cross-Lingual Machine Reading Comprehension …
WebApr 21, 2024 · In this paper, we present the first free-form multiple-Choice Chinese machine reading Comprehension dataset (C^3), containing 13,369 documents (dialogues or more formally written mixed-genre texts) and their associated 19,577 multiple-choice free-form questions collected from Chinese-as-a-second-language examinations. WebOct 2, 2024 · Two of them are classification datasets, one of them is Name Entity Recognition dataset and the last one is Machine Reading Comprehension dataset. All the dataset used in this task are Chinese and the details about them will be described in Sect. 3. The participants should train a pre-trained model and fine-tune it over … Weblarge-scale, open-domain Chinese ma-chine reading comprehension (MRC) dataset, designed to address real-world MRC. DuReader has three advantages over previous MRC datasets: (1) data sources: questions and documents are based on Baidu Search and Baidu Zhi-dao1; answers are manually generated. (2) question types: it provides rich how do you spell the french word for yes