Viglet Dumont DEP
Viglet Dumont DEP is an open-source data extraction platform. It connects your content — websites, databases, file systems, AEM, and WordPress — to Viglet Turing ES and other search engines through a reliable, asynchronous processing pipeline.
Complete Documentation — v2026.1
All guides, connectors reference and configuration in a single portable PDF
⬇ Download PDF
Recursively crawl websites with URL filtering, authentication, locale detection and incremental change detection.
Run any SQL query against Oracle, PostgreSQL, MariaDB or MySQL and index each row as a searchable document.
Walk directory trees and extract text from PDFs, Word, Excel, PowerPoint and images (OCR) via Apache Tika.
Index AEM author and publish content with delta tracking, locale mapping and custom extension points.
Pull posts, pages and custom content types from WordPress installations into any search engine.
Deliver content to Turing ES (default), Apache Solr or Elasticsearch via pluggable indexing adapters.