extract from IPTC MediaTopics Feb 2021

We are pleased to announce the latest release of IPTC NewsCodes, including our main subject vocabulary for news content, IPTC MediaTopics.

This update includes:

New Media Topics terms

The new terms were requested by MediaTopics users Ritzau in Denmark, NTB in Norway and AFP in France.

  • drowning (https://cv.iptc.org/newscodes/mediatopic/20001321)
  • men (https://cv.iptc.org/newscodes/mediatopic/20001328)
  • poisoning (https://cv.iptc.org/newscodes/mediatopic/20001322)
  • sports coaching (https://cv.iptc.org/newscodes/mediatopic/20001323)
  • sports management and ownership (https://cv.iptc.org/newscodes/mediatopic/20001324)
  • sports officiating (https://cv.iptc.org/newscodes/mediatopic/20001325)
  • torture (https://cv.iptc.org/newscodes/mediatopic/20001320)
  • women (https://cv.iptc.org/newscodes/mediatopic/20001327)
  • women’s rights (https://cv.iptc.org/newscodes/mediatopic/20001326)

Retired Media Topics terms

  • accomplishment (https://cv.iptc.org/newscodes/mediatopic/20000497). Use award and prize (20000498) or record and achievement (20000499) instead.
  • people (https://cv.iptc.org/newscodes/mediatopic/20000502). Use more specific terms instead.

Label changes to Media Topics

Please note that we only ever make changes to labels to make the meaning clearer, we never change the meaning of a term.

  • transfer -> sports transaction (http://cv.iptc.org/newscodes/mediatopic/20001148)
  • minister (government) -> minister and secretary (government) (http://cv.iptc.org/newscodes/mediatopic/20000613)
  • “athletics, track & field” -> “athletics” in en-GB and “track and field” in en-US (http://cv.iptc.org/newscodes/mediatopic/20000827)
  • plant -> flowers and plants (http://cv.iptc.org/newscodes/mediatopic/20000507)
  • imperial and royal matters -> royalty (http://cv.iptc.org/newscodes/mediatopic/20000506)

Media Topics hierarchy moves

  • “award and prize” (20000498) and record and achievement (20000499) were moved to the top level “human interest” term because we retired the parent term “accomplishment”
  • birthday (20001238), celebrity (20000505), high society (20000504) and “human mishap” (20000503) were moved to the top level “human interest” term to under the top level “human interest” term because we retired the parent term “people”.

Definition changes in Media Topics

  • Changes under “human interest” branch: animal (20000500), anniversary (20001237), award and prize (20000498), ceremony (20000501), funeral and memorial service (20001235), wedding (20001236), birthday (20001238)
  • Grammar fixes in en-GB and en-US descriptions for 20000037, 03000000, 20000140, 20000215, 20000228, 20000279, 20000321, 20000327, 20000390, 20000426, 20001229, 20001220, 20000504, 20000339, 20000571, 20000575, 20000590, 20000591, 20000600, 20000604, 20000619, 20000630, 20000658, 20000852

Changes to mappings from MediaTopics to other vocabularies

We had a major review of MediaTopic to Wikidata mappings, thanks to Lucy Butcher from Wirecutter (part of The New York Times, an IPTC member) for her contributions. Many terms have had their WIkidata mappings edited or added. In the near future, we are planning to add mappings from Wikidata back to NewsCodes.

Changes to other NewsCodes vocabularies

The Genre vocabulary had a major update, the second half of the review that was started in the February release.

New Genre terms:

  • Live Coverage (http://cv.iptc.org/newscodes/genre/LiveCoverage)
  • Preview (http://cv.iptc.org/newscodes/genre/Preview)

Retired terms:

  • Scener (https://cv.iptc.org/newscodes/genre/Scener) – use From the Scene instead
  • Text only (https://cv.iptc.org/newscodes/genre/Text_only) – Use Transcript and Verbatim instead
  • Update (https://cv.iptc.org/newscodes/genre/Update) – Use Synopsis or Briefing instead
  • Wrap (https://cv.iptc.org/newscodes/genre/Wrap) – Use Synopsis or Briefing instead
  • Wrapup (https://cv.iptc.org/newscodes/genre/Wrapup) – Use Synopsis or Briefing instead

Label (and definition) changes:

  • Daybook -> Planner (https://cv.iptc.org/newscodes/genre/Daybook)
  • Listing of Facts -> Fact Box (https://cv.iptc.org/newscodes/genre/ListingOfFacts)
  • Summary -> Briefing (https://cv.iptc.org/newscodes/genre/Summary)

Definition changes for: Biography, Birth Announcement, Curtain Raiser, Exclusive, Feature, Fixture, Forecast, From the Scene, Interview, Music, Obituary, Opinion, Polls and Surveys, Press Release, Press-Digest, Profile, Program, Question and Answer Session, Quote, Raw Sound, Response to a Question, Results Listings and Statistics, Retrospective, Review, Side bar and Supporting Information, Special Report, Synopsis.

As usual, all changes can be seen:

Please let us know if you spot any problems. If you are an IPTC member you can post issues, questions and suggestions to the NewsCodes Working Group list at iptc-newscodes-dev@groups.io.

Text and Data Mining Reservation Protocol Community Group home pageBrendan Quinn, Managing Director of IPTC, spoke on 20 April 2021 at the regular meeting of the W3C Text and Data Mining Reservation Protocol Community Group.

The Community Group, open to anyone to join, is discussing how to “facilitate a technical protocol to reserve a publisher’s right for content to be made available for text and data mining (TDM). The solution should be capable of expressing the reservation of TDM rights – following the rules set by Article 4 of the new European DSM Directive – and the availability of machine-readable licenses for TDM actors.”

The Community Group is looking at various technologies for representing machine-readable licences, and Brendan presented IPTC’s RightsML as a possible option. Based on W3C’s ODRL, RightsML allows rights holders to specify permissions, prohibitions and constraints on usage of all types of media content, so it may be a good candidate for representing rights around data mining.

Laurent Le Meur, Chair of the TDM Reservation Protocol Community Group and previous contributor to IPTC, presented at the IPTC Autumn Meeting in 2020 to discuss the proposed project.

We are excited to present to IPTC members the full agenda for the IPTC Spring Meeting 2021, taking place online from Monday May 10th to Wednesday May 12th.

We are honoured to have presentations from IPTC members Adobe, BBC, Agence France-Presse (AFP), The New York Times, Bloomberg, Austria Press Agentur (APA) and new member Scribely, along with guest presentations from the World Wide Web Consortium (W3C), Data Language, TV2 Denmark, and YLE Finland.

Themes include

  • metadata for content accessibility;
  • knowledge graphs and semantic technologies in news and media; and
  • trust and credibility, including a presentation by Leonard Rosenthal of the new Coalition for Content Authenticity and Provenance

Plus we will have all our regular presentations from our Working Groups in NewsML-G2, Photo Metadata, Video Metadata, NewsCodes (including Media Topics), News in JSON and Sports. We will also have sessions for our Standards Committee and PR Committee.

There will also be some time allocated each day to member networking. While we can’t match the networking opportunities of an in-person meeting, we will be using some new tools to make networking more interesting and approachable for members.

We are also planning to hold a special webinar the week before the meeting Introducing knowledge graphs for the media, so we can get straight into the interesting content during the member meeting and not spend time introducing the concepts.

All IPTC member organisations are welcome to attend at no cost.

IPTC members can see more information on the Spring Meeting 2021 page in the IPTC Members-Only Zone.

The IPTC Video Metadata Working Group is happy to announce the 1.0 version of the  IPTC Video Metadata Hub User Guide.

The guide introduces IPTC’s Video Metadata Hub recommendation and explains how it can be used to solve metadata management problems in any organisation that processes video content, from news agencies to advertising agencies; libraries, galleries and museums; long-form video producers such as broadcasters and movie studios; and stock video services.

As well as explaining the details of each field in the IPTC Video Metadata Hub standard, it shows through a set of use cases how it can be used in a variety of common scenarios to store rights, descriptive and administrative metadata for video content.

Pam Fisher, group lead, and the IPTC’s Video Metadata Working Group welcome feedback on the document. If your organisation handles video content, please read it and let us know what you think and what can be explained better. Comments can be send via this site’s Contact Us form or to the public Video Metadata Hub discussion list at https://groups.io/g/iptc-videometadata.

The guide can be seen at https://iptc.org/std/videometadatahub/userguide/.

 

extract from IPTC MediaTopics Feb 2021We have just released a new version of IPTC NewsCodes, which includes many changes to Media Topics.

This is the first major update since August 2020 (although we released new versions in September and October 2020 to add translations of new terms).

The changes are detailed below:

New translations for Media Topics

After many requests, we have now added an “en-US” language version, based on a contribution by Jeff Brown of Fourth Estate. Thanks Jeff!

Mostly it simply changes British English words to US English, such as “centre”/”center” and “programme”/”program”, but there are a few more substantive changes around cinema / movies and changing “holiday” to “vacation”. Also where Jeff had suggested changes to definitions, we often changed them for both British and US English.

en-GB will still be the primary language for Media Topics, but we will keep the en-GB and en-US versions in sync as we make changes.

New Media Topics terms

These were suggested by our collaborators from Ritzau via iMatrics, NTB, TT and AFP. Thanks to all.

Please note that the new terms only exist in en-GB and en-US right now, more translations will be added soon.

Update on 15 March: we have now added translations in Danish (thanks to Ritzau and iMatrics), Nowegian (thanks to NTB), Swedish (thanks to TT) and Portuguese for Brazil and Portugal (thanks to Priberam and Lusa).

Update on 12 April: We have now also added Chinese and German translations for these new and updated terms and definitions. Thanks very much to members Xinhua and dpa for their help!

Retired Media Topics terms

  • sports facilities (http://cv.iptc.org/newscodes/mediatopic/20000559 (retired)) – use medtop:20001126 “sport venue” instead
  • inline skating (http://cv.iptc.org/newscodes/mediatopic/20000967 (retired)) – use medtop:20001155 “roller sports” instead

Label changes to Media Topics 

Please note that we only ever make changes to labels to make the meaning clearer, we never change the meaning of a term.

Media Topics hierarchy moves

Definition changes in Media Topics

Changes to other NewsCodes vocabularies

As usual, the changes can be seen:

Please let us know if you spot any problems. If you are an IPTC member you can post issues, questions and suggestions to the NewsCodes Working Group list at iptc-newscodes-dev@groups.io.

We have made it to the end of 2020. And what a year it has been!

A reminder of happier times when we could meet in person – Managing Director Brendan Quinn and IPTC member representatives enjoying dinner at the 2019 Autumn Meeting in Ljubljana, Slovenia. 

The news and media industry has perhaps been affected less than the travel or hospitality industry, but 2020 was still a hugely eventful year for us all professionally and personally. Congratulations on getting through it, and our thoughts go out to those who have suffered in any way this year.

IPTC Events

Of course our member meetings, planned for Tallinn Estonia and New York USA this year, quickly became virtual events held via Zoom. It worked surprisingly well, and even allowed us to bring on some speakers and guests who wouldn’t have been able to attend or present if we had held the events physically.

You can look back at our Spring Meeting blog posts (Day 1, Day 2, Day 3) and the summary of our Autumn Meeting.

The IPTC Photo Metadata Conference was very interesting this year: from our usual small room hosted as part of the CEPIC Congress, we went to a virtual event with over 200 attendees. If you missed it, or want to re-visit, videos of the sessions are available on YouTube.

Standards work

The News in JSON Working Group submitted ninjs 1.3 for approval at the Spring Meeting, which added fields for trust indicators and genres, support for different types of headlines and alternative IDs. The ninjs generator, showing how easy it is to create a ninjs document by filling in a web form, was very popular and was the inspiration for some related tools in other working groups. Since then, the working group has been looking at more features to be included in future versions of ninjs. If you handle news in JSON in any way and you haven’t completed our News in JSON survey, please do it now!

The NewsML-G2 Working Group released NewsML-G2 2.29 in July which added some fields required for the trust and credibility project, and a new NewsML-G2 Generator tool based on the ninjs one. The group also participated in the trust and credibility projects described below. The NewsML-G2 specifications and guidelines documents have now been updated to version 2.29.

The Video Metadata Working Group released Video Metadata Hub 1.3 during the summer, which added fields to track the editing of metadata (as opposed to editing the actual video), parent video identifier, and updated the mappings to EBUCore and EIDR. The group is hard at work on promoting Video Metadata Hub and creating more introductory materials to help new users understand VMHub and why it is useful.

The NewsCodes Working Group published three updates this year, in March, June and August, and a new update will be published very soon. The NewsCodes Guidelines document was released this year, and is already proving useful both for those wishing to learn how to use NewsCodes better and for the Working Group to establish clear guidelines about when and how to add new terms. MediaTopics is now available in 11 languages and we have more translations coming!

The Photo Metadata Working Group has been very busy, with the biggest news of the year being that Google now supports IPTC Photo Metadata to display licensor information in search results, including a link back to the image owner’s “licence this image” page. The feature was launched in beta in February and launched fully in August. We have had great take-up so far, and the interest in the Photo Metadata Conference (with over 200 people registered) showed that the industry was very keen to hear about it. We also launched updates to the GetPMD tool to support new schema.org mappings, and browser plugins for Chrome and Firefox to enable easy viewing of embedded IPTC Photo Metadata in photographs on the web.

The Sports Content Working Group has had its collective head down in 2020, re-thinking the data model for sports results, statistics and performances. We have been taking a semantic view, looking at using RDF as the main data model for sports data which can then be serialised into JSON, XML and other formats. The intention is that this will also bring the model closer to schema.org in the future. We have some RDF and semantic web experts on the group who are helping with the modelling, and are taking a use-case based approach to make sure that we’re designing something that’s both useful and usable.

A discussion group “spun out” from the NewsCodes Working Group to consider Named Entities for News. So far we have had a couple of meetings to discuss our thoughts on maintaining vocabularies for named entities such as people, companies and places, and to study different approaches used by IPTC member organisations and non-members.

An ongoing project that spans several working groups is the work on Trust and Credibility. After publishing a draft guidelines document in April and a webinar that we ran in September, we plan to publish a 1.0 version in the new year.

All of our Working Groups are always looking for new participants, so if you’re interested in any of these areas, please consider joining IPTC and taking part in a working group!

IPTC appearances at conferences and in the media

There weren’t many conferences in the first part of the year as everyone adjusted to working remotely, but in the second half of the year IPTC people made quite a few appearances at other conferences and webinars.

In July, Brendan Quinn and Robert Schmidt-Nia spoke about NewsML-G2 at an Arab States Broadcasting Union metadata workshop. In September, Michael Steidl spoke on a panel with Google and Alamy at the Perpignan photojournalism conference about Google’s “Licensable Images” feature, and Brendan Quinn hosted a webinar about our work in trust and credibility.

In October,  Pam Fisher and Mark Milstein spoke about Video Metadata Hub at the DMLA conference. In November, Brendan Quinn was invited to give a keynote at the  FIBEP World Media Intelligence Congress, speaking to the media monitoring / media intelligence industry who also use quite a few IPTC standards.

Also in November, Bill Kasdorf published a column in Publisher’s Weekly about Media Topics and IPTC Photo Metadata which raised a lot of interest in the publishing industry. In December, Michael Steidl was invited to present a webinar to IPTC member BVPA about IPTC Photo Metadata.

Membership updates

  • We announced the IPTC Startup Membership category in September, and our first Startup Member to join is IMATAG.
  • DATAGROUP Consulting Services joined as a Voting Member.
  • New Associate Members are CBC / Radio Canada, iMatrics, and DeFodi Images.
  • New Individual Members are Margaret Warren and Alison Sullivan.

We’re very happy to have them all on board and joining in the IPTC community!

Some sad news

It was with great shock that we learned in early November that longstanding member Andy Read of BBC had passed away. He was a key contributor in many areas and his friendliness and enthusiasm will be hugely missed. Rest in peace, friend.

Looking forward

It seems that we have come through the worst 2020 could throw at us and things are looking up for 2021. We are already thinking about 2021’s events and how we can learn from 2020 to improve things for members and friends in 2021.

Best wishes for the holiday season from all of us at IPTC.

PS: If you have any questions or thoughts about how IPTC could help you, or if you are interested in talking about joining IPTC, please contact Managing Director, Brendan Quinn at mdirector@iptc.org.

Today we announce the launch of two new browser extensions for viewing IPTC Photo Metadata on web pages.

The GetPMD tool is one of IPTC’s most popular online resources. With the GetPMD tool, users can view the embedded IPTC metadata of any image on the web, whether it was embedded using either the IPTC IIM or the ISO XMP format. But up to now, users must copy and paste an image’s URL into the tool, or install a browser “bookmarklet”.

To make that a little bit easier, we have created the IPTC Photo Metadata Inspector, a simple browser extension that currently works with the Google Chrome and Mozilla Firefox browsers.

With the extension installed, a context menu will appear when you right-click on an image anywhere on the Web, with a menu option, “View IPTC Photo  Metadata.” If you select that option, you will be taken to getpmd.iptc.org where you can see the embedded metadata for that image.

Example of the IPTC Photo Metadata Inspector extension being used on an image on taz.de.

Please note that the Photo Metadata Inspector only works with simple images: it won’t work with embedded video thumbnails or tweets, for example.

The browser extensions are open source, the code is available from the IPTC’s GitHub repository.

Ideas for fixes and new features are welcome.

If you have feedback, please raise an issue on our GitHub repository, post suggestions to the iptc-photometadata@groups.io public discussion list, or contact us via the form on this site.

An example of the NewsML-G2 generator creating a simple NewsML-G2 XML file based on example values in a web form.

We are pleased to announce the release of the NewsML-G2 Generator, a simple tool to help understand the structure and layout of NewsML-G2 files.

To see how easy it can be to create a valid NewsML-G2 file, simply visit https://iptc.org/std/NewsML-G2/generator/, fill in the form and press the button labelled “Show content as NewsML-G2 2.29”.

Then the box below the form will be filled in with a valid NewsML-G2 document.

The tool demonstrates several key features of NewsML-G2:

  • Adding copyright and rights information through the <copyrightHolder/>, <copyrightNotice/> and <usageTerms/> elements
  • Adding news-item metadata via the <itemMeta> container, such as <firstCreated/>, <versionCreated/>, item type (text, audio, video, graphic or composite, selected via a drop-down), publication status (usage, cancelled or withheld, selected via a drop-down)
  • Adding subject metadata using IPTC Media Topics, via a selection with all of the top-level categories enabled. Subjects are added using the <subject/> construct within the <contentMeta> container.
  • Referring to the IPTC catalog that declares standard metadata vocabularies, using the <catalogRef/> tag
  • Adding the body content using embedded NITF. In the future, we will add a radio button so users can select whether to embed the news content using NITF or XHTML, which is the other common format used by IPTC members to mark up news content.

Your test content is never saved and only exists within your browser.

The source code of the generator is available in the NewsML-G2 GitHub repository.

This is a simple 1.0 version, and only scratches the surface of the capabilities of NewsML-G2. It is based on the successful ninjs generator used to demonstrate our ninjs standard, which was launched along with ninjs 1.3 earlier this year.

In the future, we are thinking of adding features such as:

  • Switch between NITF and XHTML for the content body
  • Demonstrate referring to images and video files using <remoteContent/>
  • Switch between using qcodes and URIs for metadata
  • Demonstrate multiple language support in NewsML-G2
  • Demonstrate usage of partMeta to show adding metadata to segments in files, such as audio and video
  • Integrate the tool with the ninjs generator so users can switch between ninjs and NewsML-G2 with one click!

If you have any more ideas, please raise an issue on the GitHub repository, or contact us via the IPTC Contact Us form.

To learn more about NewsML-G2, the global standard used for distributing news content, see our introduction to NewsML-G2, or the NewsML-G2 Guidelines.

schema.org is the technology used by web site owners around the world to make metadata available to search engines and other third-party services. It is widely used to embed machine-readable data in websites for products, store opening times and much more.

It is also used as one of the sources of metadata for the Google search results. The schema.org “license”, “acquireLicensePage” and “creator” properties in a page’s HTML code are used in addition to IPTC Photo Metadata embedded in image files to populate the image panel.

schema.org version 11 was released this week. It contains two new properties on the CreativeWork type (and therefore its subtypes such as ImageObject) that were created to match their equivalent properties in IPTC Photo Metadata: copyrightNotice, which matches the IPTC Photo Metadata Copyright Notice property, and creditText, which matches the IPTC Photo Metadata Credit Line property.

The new fields are not yet supported by Google images search, but hopefully will be soon.

After the recent update, the current properties mapped to schema.org and used in Google images search results are:

IPTC Photo Metadata property Matching schema.org property Used in Google search results?
Creator ImageObject -> creator Yes
Copyright Notice ImageObject -> copyrightNotice Not yet
Credit Line ImageObject -> creditText Not yet
Web Statement of Rights ImageObject -> license Yes
Licensor / Licensor URL ImageObject -> acquireLicensePage Yes

The IPTC Photo Metadata Working Group is working on a more comprehensive document showing all possible IPTC Photo Metadata fields with their schema.org and EXIF equivalents. The full mapping document will be released soon.

Yesterday Michael Steidl, Lead of the IPTC Photo Metadata Working Group, gave a webinar to Bundesverband professioneller Bildanbieter (BVPA), the Federal Association of Professional Image Providers in Germany.

Portrait of Michael W. Steidl, Lead of the IPTC Photo Metadata Working Group

The webinar focused on the recently introduced image license information for Google image searches and the possible opportunities and risks for the professional image business.

“This year, Google introduced the so-called Licensable Badge for its image search. This feature enables images to be linked to license information and to be displayed in the image search results with a corresponding link. Image seekers from advertising, editorial offices and corporate PR can follow the link to obtain further information on how to use the image. This turns Google image search into a potential marketplace. But how can image providers use the new tool for themselves? Is it worth the effort of storing the necessary metadata? Are there any economic risks involved? Will Google soon become a meta picture agency?”

In the first part of the webinar, Michael Steidl explained which image metadata must be stored in order to display photo credits and “licensable” badges on Google. He also informed participants about the problem that certain software and web platforms deletes image metadata after upload.

In the second part, Alexander Karst explains the possibilities for increasing visibility through the new features and gives an assessment of the effects on the image market.

Thanks to BVPA for hosting Michael for the webinar.