Show simple item record

dc.contributor.advisorCranefield, Stephen
dc.contributor.advisorStanger, Nigel
dc.contributor.authorMandya, Angrosh Annayappan
dc.date.available2012-10-26T01:33:04Z
dc.date.copyright2012
dc.identifier.citationMandya, A. A. (2012). Enhancing Citation Context based Information Services through Sentence Context Identification (Thesis, Doctor of Philosophy). University of Otago. Retrieved from http://hdl.handle.net/10523/2520en
dc.identifier.urihttp://hdl.handle.net/10523/2520
dc.description.abstractOver the years, scientific articles have played a vital role in disseminating scientific knowledge. Typically, the published scientific information builds upon the previous or existing scientific knowledge and citations play a key role in presenting the argument of the article and defining inter-article relationships across articles. Understanding the use of citations within articles and citation relationships across articles is essential for conducting good research. However, the search and browsing capabilities to trace these relationships are currently limited to finding documents citing a given document and providing links to citing documents, and do not provide insights about the reasons for citations either within an article or across articles. Development of information systems capable of identifying these reasons can be helpful in providing useful services for the research community. Against this background, we investigate in this thesis the possibility of identifying contexts associated with sentences in scientific articles and use this information to provide useful citation context based information services. To achieve this objective, we developed an annotation scheme that defines context types for sentences in scientific articles. An inter-rater reliability study was carried out to examine the reliability of our scheme. We achieved an overall agreement of 89.93% among annotators, indicating the acceptability of our scheme. In order to develop a contextual model of sentence context types in scientific article, we developed the Sentence Context Ontology for generating semantic contextual data that was used for developing applications that provide contextual information services. We also developed a text extraction and preparation system for processing scientific documents. Another key aspect of citation context based information services is the ability to automatically identify contexts of sentences. We achieved this by using Conditional Random Fields (CRFs), a sequential probabilistic classifier. We trained the CRF classifier using a training dataset of 1000 paragraphs extracted from 71 research articles and achieved an accuracy of 91% in classifying the sentences according to our proposed scheme. Finally, using the results obtained from the tasks described above, we developed various applications for providing citation context based information services. These included a standalone system and applications using linked data principles and Web APIs.
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherUniversity of Otago
dc.rightsAll items in OUR Archive are provided for private study and research purposes and are protected by copyright with all rights reserved unless otherwise indicated.
dc.subjectSentence Context Identification
dc.subjectCitation Classification
dc.subjectConditional Random Fields
dc.subjectSequential Classification
dc.subjectCitation Services
dc.titleEnhancing Citation Context based Information Services through Sentence Context Identification
dc.typeThesis
dc.date.updated2012-10-26T01:14:37Z
dc.language.rfc3066en
thesis.degree.disciplineDepartment of Information Science
thesis.degree.nameDoctor of Philosophy
thesis.degree.grantorUniversity of Otago
thesis.degree.levelDoctoral
otago.openaccessOpen
 Find in your library

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record