基于“讲述中国故事”日语文本的专用分词词表构建研究——以《人民网日文版》为例
A Study on Constructing a Custom Word Segmentation Dictionary for Japanese Texts Based on “Telling China’s Story”—A Case Study of People’s Daily Japanese Edition作者机构:重庆交通大学外国语学院重庆
出 版 物:《现代语言学》 (Modern Linguistics)
年 卷 期:2024年第12卷第8期
页 面:489-495页
学科分类:0502[文学-外国语言文学] 05[文学] 050205[文学-日语语言文学]
主 题:讲好中国故事 日语文本 日语分词 词表构建 人民网日文版
摘 要:用日语讲述中国故事、传播中国声音是面向日本构建中国国际形象的重要一环,也是国内日语教育的主要目标之一。《人民网日文版》正是国家对日宣传的重要媒介,可用作培养讲述中国故事日语人才的教学资源。为分析该媒体的对日新闻文本中国形象宣传现状,充分挖掘其教育教学功能,收集《人民网日文本》的新闻文本并构建语料库开展量化分析,是较为有效的研究路径。准确的分词结果是量化分析日语语料的前提,但研究发现,目前的日语分词工具难以处理中国故事日语文本的精确分词,将严重影响分析结论的可靠性。因此,本研究抽取《人民网日文版》新闻文本中与中国社会、经济、文化和科技等相关的日语表述,构建适用于中国故事日语文本的专用分词词表,并评测该词表的实用效果。Telling China’s story and spreading China’s voice in Japanese are crucial for shaping China’s international image in Japan and are also one of the primary goals of domestic Japanese language education. The People’s Daily Japanese Edition is an important medium for China’s stories towards Japan and can serve as a valuable resource for training Japanese language talents to tell China’s story. To analyze the current state of China’s image publicity in this media’s news texts and to fully exploit its educational functions, collecting and constructing a corpus of these news texts for quantitative analysis is an effective research approach. Accurate word segmentation is a prerequisite for the quantitative analysis of Japanese corpora. However, research has found that current Japanese word segmentation tools struggle to precisely segment texts related to China’s story, which significantly affects the reliability of the analysis results. Therefore, this study extracts Japanese expressions related to Chinese society, economy, culture, and technology from the People’s Daily Japanese Edition news texts, constructs a custom segmentation word dictionary for these texts, and evaluates the accuracy and practicality of this dictionary.