Posts From Category: opensource

Apache Tika Docker Examples

woman reading book whilst drinking coffee

Photo by Priscilla Du Preez on Unsplash

For a number of years I’ve been involved in the Apache Tika project as both a committer and PMC member.

With the increase in container technology usage over the past few years we spun up a separate repository for Apache Tika Server in Docker, called tika-docker with convenience images hosted on Docker Hub

This has resulted in questions on how to customise configuration and host instances that link to other services. To help people get started, we’ve created some example scenarios.

So let’s dive in and check them out.

Read More

Apache Tika and the ObjectRecognitionParser for Object Recognition and Captioning Using TensorFlow REST.

people looking at a laptop screen

Photo by John Schnobrich on Unsplash

One of the coolest new features added to Apache Tika in the past few years has been the addition of Parsers that leverage Deep Learning to perform object recognition and captioning.

Contributed by Chris Mattmann and Thejan Wijesinghe, through their work with USC Data Science, you can configure Apache Tika to call of to predefined models and get deep learning equivalent of ‘Hello World’ - tagging dog or cat pictures!

So let’s try it out.

Read More