Big Data

Big Data

The Arpia Big Data System serves as an administration platform for large amounts of data (> 200 million documents) in all common digital formats. It provides distributed indexing (storage) of data from various third-party applications in different data formats (XML, JSON, CSV, Word, PDF, etc.), combining real-time search and analysis tools, automated failover, and data recovery.

Unleash the Potential of Your Data with Big Data


The Arpia Big Data System serves as an administration platform for large amounts of data (> 200 million documents) in all common digital formats. It provides distributed indexing (storage) of data from various third-party applications in different data formats (XML, JSON, CSV, Word, PDF, etc.), combining real-time search and analysis tools, automated failover, and data recovery.

Unleash the Potential of Your Data with Big Data

Indexing (Storage) of Data

The Arpia Big Data System supports fast indexing (batch processing) of up to 200GB per hour and provides various interfaces for data indexing.

Within the Arpia BigData System, when storing data, it automatically analyzes, structures, and stores the data in JSON format to ensure data quality. During all indexing operations, the data is held in the transaction log to ensure no information is lost.

Indexing (Storage) of Data
Data Search

Data Search

The Arpia Big Data System offers an internal QueryEngine for comprehensive data search, based on the Lucene-Solr standard. Unlock the power of real-time search with this cutting-edge technology, whether you need full-text search or search based on specific document fields. Our search functionality encompasses all essential features, including full-text search, error tolerance, highlighting, faceted search, pagination, and even geospatial data search.

Import / Export

The Arpia Big Data System offers a powerful scheduler that automates internal processes using calendar functions. Whether you need periodic or one-time triggers, our scheduler provides the flexibility you need to streamline your operations. Additionally, you can easily review the historical context and results of these processes, ensuring full visibility and traceability. The scheduler can be used for the following processes:

  • Data Import/Export (File-based, DB)
  • Data Backup
  • Reindexing or Data Restoration from Backup
  • File Read/Write via SFTP
Import / Export

This website uses cookies

We use cookies to personalise content and ads, to provide social media features and to analyse our traffic. We also share information about your use of our site with our social media, advertising and analytics partners who may combine it with other information that you’ve provided to them or that they’ve collected from your use of their services.

For further informations please check our privacy policy.