Here is what to do: - type: telnet unixs.cis.pitt.edu and log in Start downloading the files, unpacking, compiling - up the point where things don't work (which is the call of "Stats"). From there, - copy the attached two files into the ngrams directory (Stats will be replaced by my version) - like in the instructions on the web, change the first line of count-unigrams.pl (on unixs.cis, this is /usr/pitt/bin/perl) - type: chmod +x count-unigrams.pl - type: ./Stats corpora/GEN.EN genesis/out ... and then proceed as explained on your webpage. I hope that works! I have tested this on Solaris 2.6 - and it works fine, Ilya tried it on the unixs machines, where it worked, and on his PC version, where something went wrong.