Subject Taxonomy for the Media - the successor of the Subject Codes
Media Topics is a 1100-term taxonomy with a focus on categorizing text. It was released in 2010 and is a development based on the IPTC Subject Codes.
The Media Topics vocabulary can be viewed on the IPTC Controlled Vocabulary server at http://cv.iptc.org/newscodes/mediatopic In addition it can be downloaded as NewsML-G2 Knowledge Item, RDF/XML or RDF/Turtle document, please read the Guidelines. A more user-friendly tree-like view is also available.
Media Topics and Subject Codes
The development of Media Topics started with the Subject Codes vocabulary, extended the tree from 3 to 5 levels and reused the same 17 top level terms. The lower level terms have been revised and rearranged. Each Media Topic provides a mapping back to one of the Subject Codes.
IPTC was looking for linguists to write classification rules for EXTRA https://iptc.github.io/extra/overview.html, an open source rules-based classification engine for news and found them now. The linguist will write Boolean rules to analyze the text of news articles and suggest the most relevant IPTC Media Topics (http://cv.iptc.org/newscodes/mediatopic), a news taxonomy of roughly 1,000 subjects. A portion […]