基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
This paper investigates a procedure developed and reports on experiments performed to studying the utility of applying a combined structural property of a text’s sentences and term expansion using WordNet [1] and a local thesaurus [2] in the selection of the most appropriate extractive text summarization for a particular document. Sentences were tagged and normalized then subjected to the Longest Common Subsequence (LCS) algorithm [3] [4] for the selection of the most similar subset of sentences. Calculated similarity was based on LCS of pairs of sentences that make up the document. A normalized score was calculated and used to rank sentences. A selected top subset of the most similar sentences was then tokenized to produce a set of important keywords or terms. The produced terms were further expanded into two subsets using 1) WorldNet;and 2) a local electronic dictionary/thesaurus. The three sets obtained (the original and the expanded two) were then re-cycled to further refine and expand the list of selected sentences from the original document. The process was repeated a number of times in order to find the best representative set of sentences. A final set of the top (best) sentences was selected as candidate sentences for summarization. In order to verify the utility of the procedure, a number of experiments were conducted using an email corpus. The results were compared to those produced by human annotators as well as to results produced using some basic sentences similarity calculation method. Produced results were very encouraging and compared well to those of human annotators and Jacquard sentences similarity.
推荐文章
吸附式热泵领域利用F-Term检索的浅析
F-Term
吸附式热泵
快速检索
检索效率
热泵领域利用F-Term的快速检索策略
F-Term
热泵
快速检索
分类系统
Timing and structural controls on skarn-type and vein-type mineralization at the Xitian tin-polymeta
LA-MC-ICP-MS U-Pb dating
Structure control
Tin-polymetallic deposit
SE China
Determination of brominated diphenyl ethers in atmospheric particulate matter using selective pressu
Brominated diphenyl ethers
Atmospheric particulate matters
Selective pressurised liquid extraction
Gas chromatography-mass spectrometry
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Extractive Summarization Using Structural Syntax, Term Expansion and Refinement
来源期刊 智能科学国际期刊(英文) 学科 医学
关键词 Data Extractive SUMMARIZATION Syntactical Structures Sentence Similarity Longest Common SUBSEQUENCE TERM EXPANSION WORDNET Local THESAURUS
年,卷(期) 2017,(3) 所属期刊栏目
研究方向 页码范围 55-71
页数 17页 分类号 R73
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2017(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
Data
Extractive
SUMMARIZATION
Syntactical
Structures
Sentence
Similarity
Longest
Common
SUBSEQUENCE
TERM
EXPANSION
WORDNET
Local
THESAURUS
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
智能科学国际期刊(英文)
季刊
2163-0283
武汉市江夏区汤逊湖北路38号光谷总部空间
出版文献量(篇)
102
总下载数(次)
0
总被引数(次)
0
论文1v1指导