I finally completed writing the toolset that I wrote for spellchecker.lu. Those tools parse text files to create text corpora, classify new words using statistical methods, detect common spelling mistakes, generate and upload the dictionary files (as well as the soon-to-be-released debian packages) and they allow me to control and maintain the online checker. All of this is now as simple as it could possibly be. For all of you that always wanted to get a glimpse of what’s going on “behind the scenes” of spellchecker.lu, I prepared a few screenshots of the graphical user interface.
Though this might or might not look fancy to you, the toolset does a damn fascinating job:
This is the starting point of the toolset, giving an overview of all the subroutines.
This picture shows a more enduring process.
For the tech-savvy of you: The toolset is actually a Bash script using dialog for the GUI. The subroutines are implemented with Java, AWK as well as some SED.








