Grabbing Exciting embedded Metadata from URLS for items

kria · October 29, 2019, 12:19am

Building a Collection of Items in my Omeka site. I wondered if there was a way to pull metadata from news articles (I have many in this collection), instead of having to manually enter each myself?

For ex: Medium.com has you fill out a lot of information on your article before you post (date, attribution, tags etc). Is it like that for most media articles and can we scrape that like Omeka does for Youtube video links?

Thanks so much. You’re saving me a lot of time.

jflatnes · November 12, 2019, 7:18pm

A plugin can do this (that’s what’s happening with Youtube for you, I think), but out of the box there’s not “automatic” support for pulling metadata from a link like that.

Part of the issue is that different sites often describe themselves very differently, so it’s hard to do that in a generic way.

kloor · November 14, 2019, 2:07pm

I have a plugin that may be of use for you:

It was mainly written to make embedding multimedia content into Omeka easier, but it also does pull some metadata. You may want to give it a shot to see if it helps.

It uses another package to actually retrieve the information from a resource. You can see a demo of that package for any URL here:
https://oscarotero.com/embed3/demo/

I probably need to package a new release with the updates to that dependency, and I’m open to any suggestions if you use it.

kria · November 14, 2019, 11:23pm

@kloor Thank you! I’ll give it a try now.

system · July 21, 2020, 11:23pm

This topic was automatically closed 250 days after the last reply. New replies are no longer allowed.