Language article in The Economist
People are passing around the office an
article in the latest edition of The Economist, called "Corpus Colossal", about issues linguists are having in using the Internet as a source for training and research data. Obviously it's a big concern to us too, since we are monster consumers of such data, and finding it for non-English languages is especially tough without the Internet.
We all thought it was funny how the article mentions that computational linguists are being hired by shady web site operators in order to skew their search patterns to get past the screens of search engines.
The article links to the
Language Log, which it describes as an informal place where linguists write.