[1]闻永毅,王治梅.中医文献语料库建设与顶层设计刍议[J].西部中医药,2018,31(07):62-65.
 WEN Yongyi,WANG Zhimei.Construction of Chinese Medical Literature Corpus and Its Critical Issues in the Top-down Design[J].Western Journal of Traditional Chinese Medicine,2018,31(07):62-65.
点击复制

中医文献语料库建设与顶层设计刍议()
分享到:

《西部中医药》[ISSN:2096-9600/CN:62-1204/R]

卷:
31
期数:
2018年07期
页码:
62-65
栏目:
出版日期:
2018-07-15

文章信息/Info

Title:
Construction of Chinese Medical Literature Corpus and Its Critical Issues in the Top-down Design
文章编号:
1004-6852(2018)07-0062-04
作者:
闻永毅王治梅
陕西中医药大学外语学院,陕西 咸阳 712046
Author(s):
WEN Yongyi, WANG Zhimei
School of Foreign Languages, Shaanxi University of Chinese Medicine, Xianyang 712046, China
关键词:
中医语料库分词信息抽取顶层设计
Keywords:
Chinese medicine corpus word segmentation information extraction top-down design
分类号:
R222
文献标志码:
A
摘要:
从中医文本的基本特征与自动处理系统的协调问题、中医专业术语的分词与标注方法问题、非专业术语分词中的困难问题和文本标注方法问题、问题的解决方案4方面入手探讨中医文献语料库建设,指出中医文献语料库建设的根本目的是数据分析和信息抽取,这涉及到原始文件版本选择、库文件分词与标注、检索与信息抽查、自动加工工具开发等建库环节,是一项独具特色的系统工程,需要从顶层设计的层面配置各子系统的基本参数,使之达到数据类型一致、数据层次分明、子系统相互衔接、信息抽取全面可靠的整体效果。
Abstract:
The paper is focusing on the construction of Chinese medical literature corpus by dealing with following four issues, namely the coordinating problems between the textual characteristics of Chinese medical literature and autonomic processing system, the method of medical terms’ segmentation and annotation, the difficulties in segmenting and annotating the non-specializing terms and their solutions. To fundamentally achieve the aim of data analysis and information extraction, it is indispensable to tackle such links in constructing the Chinese medical literature corpus as edition selection of original files, word segmentation and annotation of corpus files, information retrieval and checks as well as automatic processing tools development. As a unique systematic program, the construction of Chinese medical literature corpus needs basic parameters in each subsystem designed from top level to achieve an overall and reliable effect of data-type consistency, data-hierarchical coherence, subsystems interconnection and information-extracted entirety.

相似文献/References:

[1]葛健文.中医师承教育之我见[J].西部中医药,2013,26(01):35.
 GE Jianwen.On the Succession of Teachings for TCM Doctors[J].Western Journal of Traditional Chinese Medicine,2013,26(07):35.
[2]韩文均,孙建明,叶玉妹,等.少弱精子症的中西医治疗近况[J].西部中医药,2013,26(03):124.
 HAN Wenjun,SUN Jianming,YE Yumei,et al.Recent Development of Integrative Medicine in Treating Oligoasthenozoospermia[J].Western Journal of Traditional Chinese Medicine,2013,26(07):124.
[3]韩月,卢苏.围绝经期妇女免疫功能下降的研究进展[J].西部中医药,2012,25(08):114.
 HAN Yue,LU Su.Study on Immunity Decrease of Climacteric Women[J].Western Journal of Traditional Chinese Medicine,2012,25(07):114.
[4]彭燕.中医辨证护理对小儿流行性腮腺炎临床疗效的影响[J].西部中医药,2013,26(08):122.
 PENG Yan.TCM Nursing for Children with Mumps[J].Western Journal of Traditional Chinese Medicine,2013,26(07):122.
[5]赵海东,曹希勤△.论张仲景著《伤寒杂病论》的必然性与偶然性[J].西部中医药,2014,27(08):26.
[6]许肖娥,路军锋,董国斌,等.甘肃省通渭县农村居民对中医疗法的认识与需求现状调查[J].西部中医药,2014,27(08):39.
[7]梁永林,李娟,吕金童,等.中医方象探究*[J].西部中医药,2014,27(09):35.
[8]包益洁,殷佩浩△.桥本氏甲状腺炎的中医研究概况[J].西部中医药,2014,27(10):167.
[9]孙东东,周景玉,史文川,等.中医全科医学研究生培养模式的思考*[J].西部中医药,2014,27(12):39.
[10]杨丽萍,赵永强,史文宇,等.2007—2010年某三甲中医院住院患者转归分析[J].西部中医药,2014,27(12):48.
[11]闻永毅,王治梅,杨婷.中医文献语料库自动分词中的新词发现研究[J].西部中医药,2018,31(09):71.
 WEN Yongyi,WANG Zhimei,YANG Ting.Study on Lexicon Updating in Automatic Word Segmentation of Chinese Medical Literature Corpus[J].Western Journal of Traditional Chinese Medicine,2018,31(07):71.

备注/Memo

备注/Memo:
收稿日期:2017-11-03 *基金项目:2016年度国家社会科学基金项目(编号:16XYY011)。 作者简介:闻永毅(1966—),男,副教授。研究方向:中医文献语料库研究。
更新日期/Last Update: 2018-07-15