![]() |
| ARK researchers in October 2012. |
Noah's ARK[1] is Noah Smith's informal research group at the Language Technologies Institute, School of Computer Science, Carnegie Mellon University. (The research is formal; the group is informal.) As you may have guessed, our research focuses on problems of ambiguity and uncertainty in natural language processing, including morphology, syntax, semantics, translation, and behavioral/social phenomena observed through language—all viewed through a computational lens.
[1]The acronym is ambiguous; possible interpretations might include Ambiguity Research Kith or Ambiguity Resolution K. or A. R. Kibbutz. With apologies to the Bible and DAGS.
![]() |
| ARK researchers in April 2009. Picture by Mattt Thompson. |
| Outdated Photo | Name | Position | Topics | Languages (Spoken and/or Written) | Languages (Researched) | Languages (Hacked In) | Favorite Term of Venery |
|---|---|---|---|---|---|---|---|
![]() | Waleed Ammar | Ph.D. student, LTI | statistical machine translation, text analytics | Arabic, English | English, Arabic, Hebrew, Kinyarwanda | C#, C/C++, Javascript, ruby, Java, PHP, ASP.net | charm ![]() |
![]() | David Bamman | Ph.D. student, LTI | sociolinguistic variation; statistical NLP for computational social science and the humanities | English, Latin, Ancient Greek, Italian (un po), French (un peu), German (ein bisschen), Mandarin Chinese (一点儿) | English, Latin, Ancient Greek, Chinese | Java, Python, Perl | coalition
![]() |
![]() | Victor Chahuneau | M.S. student, LTI | |||||
![]() | Chris Dyer | Assistant Professor, LTI & MLD | machine translation, unsupervised learning, text-based forecasting, big data | English, German | Arabic, Chinese, Czech, Dutch, English, French, German, Hungarian, Telugu, Turkish, Urdu, Welsh | C++, Perl, Java | conspiracy![]() |
![]() | Behrang Mohit | Post-doc, CMU-Q | Arabic NLP, machine translation, semantics | English, Persian (Farsi), Arabic | English, Arabic | Java, Python | |
![]() | Brendan O'Connor | Ph.D. student, MLD | text analysis and social science | English, German | English, Chinese | R, Awk, etc. | prickle![]() |
![]() | Bryan Routledge | Associate Professor of finance, Tepper | Finance, asset pricing | English, Canadian | N/A | Matlab, R, Stata, Perl, Excel, Cobol | pod![]() |
![]() | Nathan Schneider | Ph.D. student, LTI | semantics and its relation to linguistic structure; cognitive linguistics | English, Hebrew (קצת), Arabic (قليل), French (un peu), German (ein bisschen) | English, Hebrew, Arabic | Python, Java, PHP, Scheme, Javascript | smack![]() |
![]() | Yanchuan Sim | Ph.D. student, LTI | Bayesian graphical modeling, text mining | English, Chinese (Mandarin, Teochew, Cantonese, Hokkien) | English | C/C++, Java, Python | exaltation![]() |
![]() | Sam Thomson | M.S. student, LTI | semantic parsing | English, Spanish | English | Python, Java, JavaScript, R | murder ![]() |
![]() | Tae Yano | Ph.D. student, LTI | NLP in the political domain, rich models of structured NL data (e.g., blogs) | Japanese, English, Spanish, French | English | C, C++, Java, Perl, Python | husk![]() |
![]() | Dani Yogatama | Ph.D. student, LTI | text-driven forecasting | Indonesian, Japanese, English | English, Japanese, French, Spanish | C/C++, Java, Python, Matlab | band![]() |
| Noah Smith | Associate Professor, LTI & MLD | (most of the above) | English, French (un peu) | Arabic, Bulgarian, Czech, English, French, German, Hebrew, Korean, Mandarin, Portuguese, Turkish | LaTeX | parade![]() |