CLIFF-CLAVIN parses news articles and pulls out people, organizations and places mentioned. A number of tools do this, so why did we create CLIFF-CLAVIN? We’ve built on those tools to add disambiguation tailored to the ways news articles are written, and a concept of “focus” that tries to get at what place an article is really about (as opposed to all the places it mentions). We create CLIFF-CLAVIN to help drive the Media Cloud suite of media analysis tools, but are sharing it in hopes that others find it useful.
- Details: ourcup.info
- Docker “quick” install on GitHub
- Python API client on GitHub
- Source code on GitHub
- D’Ignazio, C., Bhargava, R., Zuckerman, E., & Beck, L. (2014). CLIFF-CLAVIN: Determining geographic focus for news. In NewsKDD: Data Science for News Publishing, at KDD 2014. New York, NY, USA.