Show simple item record

dc.contributor.authorSallis, Philipen_NZ
dc.contributor.authorKassabova, Dianaen_NZ
dc.date.available2011-04-07T03:05:24Z
dc.date.copyright1997-12en_NZ
dc.identifier.citationSallis, P., & Kassabova, D. (1997). Usenet newsgroups’ profile analysis utilising standard and non-standard statistical methods (Information Science Discussion Papers Series No. 97/11). University of Otago. Retrieved from http://hdl.handle.net/10523/886en
dc.identifier.urihttp://hdl.handle.net/10523/886
dc.descriptionPlease note that this is a searchable PDF derived via optical character recognition (OCR) from the original source document. As the OCR process is never 100% perfect, there may be some discrepancies between the document image and the underlying text.en_NZ
dc.description.abstractThe paper explores building profiles of Newsgroups from a corpus of Usenet E-mail messages employing some standard statistical techniques as well as fuzzy clustering methods. A large set of data from a number of Newsgroups has been analysed to elicit some text attributes, such as number of words, length of sentences and other stylistic characteristics. Readability scores have also been obtained by using recognised assessment methods. These text attributes were used for building Newsgroups’ profiles. Three newsgroups, each with similar number of messages were selected from the processed sample for the analysis of two types of one-dimensional profiles, one by length of texts and the second by readability scores. Those profiles are compared with corresponding profiles of the whole sample and also with those of a group of frequent participants in the newsgroups. Fuzzy clustering is used for creating two-dimensional profiles of the same groups. An attempt is made to identify the newsgroups by defining centres of data clusters. It is contended that this approach to Newsgroup profile analysis could facilitate a better understanding of computer-mediated communication (CMC) on the Usenet, which is a growing medium of informal business and personal correspondence.en_NZ
dc.format.mimetypeapplication/pdf
dc.publisherUniversity of Otagoen_NZ
dc.relation.ispartofseriesInformation Science Discussion Papers Seriesen_NZ
dc.subject.lcshQA76 Computer softwareen_NZ
dc.titleUsenet newsgroups’ profile analysis utilising standard and non-standard statistical methodsen_NZ
dc.typeDiscussion Paperen_NZ
dc.description.versionUnpublisheden_NZ
otago.bitstream.pages17en_NZ
otago.date.accession2011-01-18 19:56:27en_NZ
otago.schoolInformation Scienceen_NZ
otago.openaccessOpen
otago.place.publicationDunedin, New Zealanden_NZ
dc.identifier.eprints1045en_NZ
otago.school.eprintsInformation Scienceen_NZ
dc.description.referencesJ. Bezdek, Pattern Recognition with Fuzzy Objective Function Algorithms (Plenum Press, New York, 1981). N. Gulley and J.-S. Roger Jang, Fuzzy Logic Toolbox User’s Guide, (The MathWorks, Inc., 1996). D. Harman, Overview of the Third Text Retrieval Conference (TREC8), In: D.K. Harman (ed), The Third Text Retrieval Conference, April 1995 (NIST, Gaithersburg, M_D, 1995) C. Harrison, Readability in the Class Room (Cambridge University Press, 1980). T. Kohonen, Exploration of Very Large Databases by Self-Organising Maps, In: Proceedings of the 1997 International Conference on Neural Networks (ICNN’97), Houston, June 1997, (IEEE, June, 1997). T. Kohonen, Self-Organization and Associative Memory, 2nd Edition, (Springer-Verlag, Berlin, 1987) M. Moraes, What is Usenet, Frequently posted article to news:news.announce.newusers, (accessed on 1 July 1997). S. E. Robertson et al, OKAPI at TREC-3. In: D.K. Harman (ed.), The Third Text Retrieval Conference, April 1995 (NIST, Gaithersburg, MD, 1995). H. Thimbleby, Internet, Discourse and Interaction Potential, In: L.K. Yong, L. Herman, Y.K. Leung, and J. Moyes (eds), APCHI’96, Proceedings of The First Asia Pacific Conference on Computer Human Interaction, Singapore, 25-28 June 1996 (Information Technology Institute, Singapore, 1996).en_NZ
otago.relation.number97/11en_NZ
 Find in your library

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record