AQMAR Arabic Wikipedia Supersense Corpus

This is a 65,000-token corpus of 28 Arabic Wikipedia articles hand-annotated for nominal supersenses. It extends the Named Entity Corpus and was developed by Nathan Schneider, Behrang Mohit, Kemal Oflazer, and Noah Smith as part of the AQMAR project.

Download

Further Reading

Please cite the following if you write any papers involving the use of the data above:

Acknowledgments

This research was supported by Qatar National Research Fund grant NPRP 08-485-1-083.

Contact

Please e-mail nschneid [strudel] cs.cmu.edu or behrang [strudel] cmu.edu with questions.