My name is Jessica Hutchinson. I am a Computer Science/Linguistics postgraduate student at
Rhodes University, South Africa, doing a research project in machine
translation of sign languages. My research involves looking at the
various notations used to represent sign languages and
making a comparison in terms of how effectively each of them can be
parsed by a computer. The results of my research I hope will further improve the area of Sign-To-Text translation.
I would like to build a couple of small systems that do
statistical translation from sign notation to English text, for each of
the main notations (starting with Sign Writing, HamNoSys and Stokoe). To
do this, however, I need quite a bit of data, and I have had trouble
finding any kind of parallel corpus for these notations. I am hoping to do the translation on a sentence-by-sentence
basis, so the kind of data I am looking for would be sentences in a sign
language matched to sentences in English. I'm expecting ASL to be most
accessible, and for comparison's sake
would prefer each notation in the same language, but anything will be
Furthermore, data would need to be in some kind of format that a computer would be able to process - images in a pdf file aren't going to work so well!
If anyone can help me to find that data that I need, I would greatly appreciate it.
Research website: http://www.cs.ru.ac.za/research/g09h2318