Talk

Indexing your office documents with Elastic stack and FSCrawler

Byte Size
Data & AI

You have plenty of Open Office, Microsoft Office, PDF, images... documents and you may want to be able to search for their metadata and content. How can you do that?


In this talk, David will explain how Apache Tika can be used for that and how to combine this fantastic library with Elastic Stack:


* Elasticsearch ingest-attachment plugin

* FSCrawler

* Workplace Search connector for FSCrawler to have a ready to use and powerful user interface for your documents.

Scheduled on Wednesday from 13:15 to 13:30 (Europe/London) in Gallery Hall

Elasticsearch
OCR
Enterprise

David Pilato

Elastic

David Pilato is Developer and Evangelist at elastic and French spoken language User Group creator. In his free time, he likes talking about elasticsearch in conferences or in companies (Brown Bag Lunches AKA BBLs).