Some background

 

My background: Very early paper on machine parsing of natural language, 1962. Then a career in theoretical physics. Then on the Illinois biology faculty, UIUC, 1975-1985. At Northeastern U. in the College of Computer Science since 1986. Established the Biological Knowledge Laboratory there in 1989.

My goal is not to treat biological text analysis as a supplement to sequence and related databases, but to pursue text analysis as the primary goal. Text corpora are still far larger than the genome and are both the answer to and the limiting step in our understanding of living systems.