Skip to main content
v2026.1Open SourceApache 2.0

Viglet Dumont DEP

Viglet Dumont DEP is an open-source data extraction platform. It connects your content — websites, databases, file systems, AEM, and WordPress — to Viglet Turing ES and other search engines through a reliable, asynchronous processing pipeline.

📄

Complete Documentation — v2026.1

All guides, connectors reference and configuration in a single portable PDF

⬇ Download PDF

Key Capabilities
🌐
Web Crawler

Recursively crawl websites with URL filtering, authentication, locale detection and incremental change detection.

🗄️
Database Connector

Run any SQL query against Oracle, PostgreSQL, MariaDB or MySQL and index each row as a searchable document.

📁
FileSystem Connector

Walk directory trees and extract text from PDFs, Word, Excel, PowerPoint and images (OCR) via Apache Tika.

📦
AEM Connector

Index AEM author and publish content with delta tracking, locale mapping and custom extension points.

📝
WordPress Connector

Pull posts, pages and custom content types from WordPress installations into any search engine.

🔌
Multi-Target Indexing

Deliver content to Turing ES (default), Apache Solr or Elasticsearch via pluggable indexing adapters.

Quick Start
Connectors
Architecture & Pipeline
Technical Reference