|ARK researchers in October 2012.|
Noah's ARK is Noah Smith's informal research group at the Language Technologies Institute, School of Computer Science, Carnegie Mellon University. (The research is formal; the group is informal.) As you may have guessed, our research focuses on problems of ambiguity and uncertainty in natural language processing, including morphology, syntax, semantics, translation, and behavioral/social phenomena observed through language—all viewed through a computational lens.
The acronym is ambiguous; possible interpretations might include Ambiguity Research Kith or Ambiguity Resolution K. or A. R. Kibbutz. With apologies to the Bible and DAGS.
|ARK researchers in April 2009. Picture by Mattt Thompson.|
|Outdated Photo||Name||Position||Topics||Languages (Spoken and/or Written)||Languages (Researched)||Languages (Hacked In)||Favorite Term of Venery|
|David Bamman||Ph.D. student, LTI||sociolinguistic variation; statistical NLP for computational social science and the humanities||English, Latin, Ancient Greek, Italian (un po), French (un peu), German (ein bisschen), Mandarin Chinese (一点儿)||English, Latin, Ancient Greek, Chinese||Java, Python, Perl||coalition
|Victor Chahuneau||M.S. student, LTI|
|Chris Dyer||Assistant Professor, LTI & MLD||machine translation, unsupervised learning, text-based forecasting, big data||English, German||Arabic, Chinese, Czech, Dutch, English, French, German, Hungarian, Telugu, Turkish, Urdu, Welsh||C++, Perl, Java||conspiracy |
|Behrang Mohit||Post-doc, CMU-Q||Arabic NLP, machine translation, semantics||English, Persian (Farsi), Arabic||English, Arabic||Java, Python|
|Brendan O'Connor||Ph.D. student, MLD||text analysis and social science||English, German||English, Chinese||R, Awk, etc.||prickle|
|Bryan Routledge||Associate Professor of finance, Tepper||Finance, asset pricing||English, Canadian||N/A||Matlab, R, Stata, Perl, Excel, Cobol|| pod|
|Yanchuan Sim||Ph.D. student, LTI||Bayesian graphical modeling, text mining||English, Chinese (Mandarin, Teochew, Cantonese, Hokkien)||English||C/C++, Java, Python||exaltation|
|Tae Yano||Ph.D. student, LTI||NLP in the political domain, rich models of structured NL data (e.g., blogs)||Japanese, English, Spanish, French||English||C, C++, Java, Perl, Python||husk|
|Dani Yogatama||Ph.D. student, LTI||text-driven forecasting||Indonesian, Japanese, English||English, Japanese, French, Spanish||C/C++, Java, Python, Matlab|| band|
|Noah Smith||Associate Professor, LTI & MLD||(most of the above)||English, French (un peu)||Arabic, Bulgarian, Czech, English, French, German, Hebrew, Korean, Mandarin, Portuguese, Turkish||LaTeX||parade|