MuNPEx is a multi-lingual noun phrase (NP) extraction component developed for the GATE architecture, implemented in JAPE. It currently supports English, German, French, and Spanish (in beta).
MuNPEx requires a part-of-speech (POS) tagger to work and can additionally use detected named entities (NEs) to improve chunking performance. Please read the documentation (or source code) for more details.
Alle Programme und Resourcen auf der Liste sind frei, d.h. kostenlos (für Forschungszwecke) verfügbar, auf deutschsprachige Texte anwendbar und sofort startklar, d.h. sie müssen nicht erst mit Hilfe von z.B. annotierten Korpora trainiert werden. Die Liste ist natürlich unvollständig (Stand 22.5.2007).
This file contains the index from English Verb Classes And Alternations: A Preliminary Investigation, by Beth Levin, published by The University of Chicago Press
The TreeTagger is a tool for annotating text with part-of-speech and lemma information which has been developed within the TC project at the Institute for Computational Linguistics of the University of Stuttgart. The TreeTagger has been successfully used to tag German, English, French, Italian, Dutch, Spanish, Bulgarian, Russian, Greek, Portuguese, Chinese and old French texts and is easily adaptable to other languages if a lexicon and a manually tagged training corpus are available.
Online Demo of the TreeTagger. A tool for annotating text with part-of-speech and lemma information which has been developed at the Institute for Computational Linguistics of the University of Stuttgart.
Shalmaneser is a supervised learning toolbox for shallow semantic parsing, i.e. the automatic assignment of semantic classes and roles to text. The system was developed for Frame Semantics; thus we use Frame Semantics terminology and call the classes frames and the roles frame elements. However, the architecture is reasonably general, and with a certain amount of adaption, Shalmaneser should be usable for other paradigms (e.g., PropBank roles) as well. Shalmaneser caters both for end users, and for researchers.
I am investigating computational models for linguistic structures and processes, with application to language technologies and to the documentation of endangered languages. My current focus is on efficient query for databases of hierarchically annotated data. After completing a PhD on computational phonology at the University of Edinburgh in 1990, I worked on a series of European research projects and conducted linguistic fieldwork in Cameroon with SIL. In 1998 I moved to the University of Pennsylvania, becoming Associate Director of the LDC, and working on models and tools for linguistic annotation. In 2002 I returned home to Australia and established the Melbourne University Language Technology Group. In 2007 I was awarded the Kelvin Medal for excellence in teaching.
Key Activities: Coordinating first year Informatics; developing the Natural Language Toolkit; writing a textbook on NLP; leading the Language Technology Group; working on an NSF project on Querying Linguistic Databases; and editing Cambridge Studies in Natural Language Processing and the ACL Anthology.
Key Publications: Natural Language Processing in Python; Computational phonology: A constraint-based approach (Cambridge); A formal framework for linguistic annotation (Speech Communication); Seven dimensions of portability for language documentation and description (Language); Designing and evaluating an XPath dialect for linguistic queries (ICDE).
L. Tang, Y. Zhang, and X. Fu. Proceedings of 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, page 58- 61. (2005)
T. Pardo, and L. Rino. Proceedings of the 3rd International Conference: Portugal for Natural Language Processing - PorTAL, volume 2389 of LNAI, page 263-273. Faro, Portugal, (June 2002)
T. Pardo, L. Antiqueira, M. Nunes, O. Oliveira Jr., and L. Costa. Proceedings of the International Conference on Communications, Circuits and Systems (ICCCAS'06) - Special Session on Complex Networks, page 2678-2682. Gui Lin, China, UESTC Press, (June 2006)
J. Otterbacher, G. Erkan, and D. Radev. Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), page 915-922. Vancouver, British Columbia, Canada, Association for Computational Linguistics, (October 2005)
V. Nastase, and S. Szpakowicz. Proceedings of TextGraphs: the Second Workshop on Graph Based Methods for Natural Language Processing, page 29-32. New York City, Association for Computational Linguistics, (June 2006)