I am encountering an odd issue with search results from other search engines (such as Google or my institution’s website search engine - which is powered by Google) giving results for my institution’s Omeka instance in the various Output Formats. The most common search result that will pop up is the “omeka-xml” output. Is there a way to block those outputs from showing up in search engines? And if not, would removing those Output Formats from the website help alleviate this issue?

Here is an example from my institution’s website: https://www.sjc.edu/search?q=marquis+de+lafayette#gsc.tab=0&gsc.q=marquis%20de%20lafayette&gsc.page=1.

I wanted to post this topic again to see if anyone has any suggestions…

A robots.txt should do it, something like the following in robots.txt in your site root:

User-Agent: *
Disallow: /*?*output=omeka-xml*

