A Spellchecker for Assamese documents has been developed. It employs
several techniques viz. edit distance, n-gram and soundex encoding schemes. Two
versions of the spell checker are available – (i) standalone and (ii) integrated
version with MSWord.
Spell Checker(standalone): The Resource Centre has
developed a standalone version of Assamese spellchecker. The spellchecker
employs a number of techniques for detection and correction of Assamese
documents viz. edit distance, n-gram and soundex encoding techniques. (more
Features of the standalone
- GUI developed using Perl/Tk,
with simple text editing facilities
- Currently supports C-DAC’s
- Assamese text files can be
loaded onto the GUI, edited and saved.
- Misspellings can be marked
by clicking the ‘Det’ (‘Detect’) and then the ‘Show’ button.
- Selecting the misspelt word
and clicking ‘Sug’ (‘Suggest’) button can generate suggestions.
- A facility to add new words
to the dictionary exists.
- ‘Select All/Unselect All’
option available in the ‘Edit’ menu on right clicking on the text, for
selecting/deselecting entire text.
- ‘Copy’/’Cut’ and ‘Paste
operations possible for text within the GUI window.
- Text can be copied/cut from
an I-leap document and pasted onto the GUI window.
Assamese Spellchecker (Integrated
The standalone version of the spellchecker is integrated with Microsoft Word
2000. The two separate entities have been integrated using PerlCtrl, which
allows sharing of data and code using ActiveX technology.
Features of the standalone spellchecker are:
Supports C-DAC font
(AS-TTDurga.ttf) and also the font developed by the Resource center (Asamiya.ttf)
marked by curved lines
Suggested words for
misspellings are displayed in a separate window
‘Paste operations possible for text within the GUI window.