A Statistical Model of Error Correction for Computer Assisted Language Learning Systems

Halizah Basiron

Back

A Statistical Model of Error Correction for Computer Assisted Language Learning Systems

Doctoral Thesis

Open access

A Statistical Model of Error Correction for Computer Assisted Language Learning Systems

Halizah Basiron

Doctor of Philosophy - PhD, University of Otago

University of Otago

2012

Handle:

https://hdl.handle.net/10523/2291

Abstract

computer assisted language learning systems

natural language processing

language modeling

This thesis presents a study in the area of computer assisted language learning systems. The study focusses on the topic of automatic correction of student errors. In the thesis, I will describe a novel statistical model of error correction, trained on a newly gathered corpus of language data from Malaysian EFL learners, and tested on a different group of students from the same population. The main novelty of my statistical model is that it explicitly represents 'corrections' -- i.e. circumstances where a language teacher corrects a student's language. Most statistical models used in language learning applications are simply models of the target language being taught; their aim is just to define the kinds of sentence which are expected in the target language. These models are good at recognising when a student's utterance contains an error: any sentence which is sufficiently improbable according to a model of the target language can be hypothesised to contain an error. But they are not so good at providing suggestions about how to correct errors. In any student's sentence, there are many things which could be changed: the space of possible corrections is too large to be exhaustively searched. A statistical system which explicitly models the incidence of corrections can help guide the search for good corrections. The system which I describe in this thesis learns about the kinds of context in which particular corrections are made, and after training, is able to make quite good suggestions about how to correct sentences containing errors.

Files and links (1)

pdf

BasironHalizah2012PhD.pdfDownload View

Metrics

2519 File views/ downloads

1099 Record Views

Details

Record Identifier: 9926478828001891
Title: A Statistical Model of Error Correction for Computer Assisted Language Learning Systems
Creators: Halizah Basiron
Contributors: Alistair Knott (Advisor / Supervisor)
Anthony Robins (Advisor / Supervisor)
Academic Unit: Computer Science
Publisher: University of Otago
Degree Awarded: Doctor of Philosophy - PhD
Project Type: Thesis - Doctoral
Awarding Institution: University of Otago
Date published ; e-published: 2012
Wikidata ID: Q112465107
Language: English
Resource Type ; Subtype: Doctoral Thesis
Format: application/pdf