Intercept media upload?

symac · May 22, 2018, 4:12am

Hello,
currently porting our digital library to omeka-s, there are some modules that we need to upgrade to the new system. One of them is ExtractOcr which one of the goals is to extract the OCR from pdf files when they are added to the library.

To achieve this, the hookBeforeSaveFile function is being used. So far I have not been able to find a way to reproduce that using the ingester system of omeka-s.

Am I missing something or the only way would be to add a custom ingester?

Daniel_KM · May 22, 2018, 5:08am

Depending on your process, you can use api.create.pre or api.hydrate.pre (or post if you launch a job). See https://omeka.org/s/docs/developer/reference/events:

            $sharedEventManager->attach(
                \Omeka\Api\Adapter\MediaAdapter::class,
                'api.create.pre',
                [$this, 'createMedia']
            );

Daniel_KM · May 22, 2018, 5:12am

Note that there is PdfText to extract text too (see https://daniel-km.github.io/UpgradeToOmekaS for all upgraded modules or in progress).

symac · May 22, 2018, 6:44am

Thanks @Daniel_KM I should have thought about looking at source of PdfText, that’s exactly what we need. Thanks for the pointer, I had not yet encountered the events listeners when working with omeka-s, that will be extremely useful.

XavierM · June 5, 2018, 9:41am

Hi,

is there a triggered event after add media to an item from the edit page ?

In this example :

$sharedEventManager->attach(
                \Omeka\Api\Adapter\MediaAdapter::class,
                'api.create.post',
                [$this, 'createMedia']
            );

‘createMedia’ is only triggered when i use directly the omeka api, it doesn’t work
after a classic import of media.

Is it an issue or it’s common ?

Thanks.

XavierM · June 5, 2018, 1:22pm

github.com

bubdxm/Omeka-S-module-PdfToc/blob/93ee21e581221ca7a9350e36e9eae6169f62ac86/Module.php#L52


/**
 * Attach listeners to events.
 *
 * @param SharedEventManagerInterface $sharedEventManager
 */
public function attachListeners(SharedEventManagerInterface $sharedEventManager)
{
    $sharedEventManager->attach(
        'Omeka\Api\Adapter\MediaAdapter',
        'api.create.post',
        [$this, 'extractToc']
    );
}




public function extractToc(\Zend\EventManager\Event $event)
{
    $response = $event->getParams()['response'];
    $media = $response->getContent();
    $fileExt = $media->getExtension();

Daniel_KM · June 5, 2018, 1:51pm

When you add a media from the edit page, itemAdapter is triggered and it manages the media directly (so, the api events are not triggered for media: see item adapter).

XavierM · June 5, 2018, 1:59pm

So, i need to browse all media in $item->getMedia() and when i found a pdf media i launch my job?

Daniel_KM · June 5, 2018, 2:04pm

This is like that ArchiveRepertory works. But depending on your job, you may use other events related to the entity (media in your case) “entity.persist.pre”, but it is more complex to manage.

XavierM · June 5, 2018, 2:11pm

Thanks for your help.