Technical Validation of Vertica in Eon Mode and Virtuozzo S3 Storage

This document outlines the technical validation process to confirm compatibility between Vertica in Eon Mode and Virtuozzo Infrastructure S3 storage. It is intended for customers seeking technical details about the integration, assuming a high level of technical knowledge regarding the described technologies. The document provides details about the technologies used, the environment setup, and the validation tests performed.

Technology overview

Vertica

Vertica is a unified analytics data warehouse with a massively scalable architecture. It supports a wide range of analytical functions, including event and time-series analysis, pattern matching, geospatial analysis, and in-database machine learning. Vertica enables organizations to gain predictive business insights faster than other data warehouses by applying these powerful functions to large and demanding workloads. The platform operates across major public clouds and on-premises environments, integrating with cloud object storage and HDFS without requiring data migration.

Vertica offers two deployment modes on a unified platform:

  • Vertica in Enterprise Mode runs on industry-standard servers with tightly coupled storage for consistently high performance in use cases that demand predictable compute capacity.
  • Vertica in Eon Mode has a cloud-native architecture that separates compute from storage, allowing flexible resource management for variable workloads. This architecture enables specific compute resources to be applied to shared storage for different use cases.

In this validation, Vertica in Eon Mode leverages S3 API integration, ensuring compatibility with object storage solutions like Virtuozzo Infrastructure S3.

Virtuozzo Infrastructure S3

Virtuozzo Infrastructure provides scalable, high-performance storage, capable of supporting analytics and machine learning workloads. With support for the widely used S3 API, Virtuozzo’s S3 storage enables real-time analytics and accelerated machine learning training. It serves as a robust foundation for structured and unstructured data lakes with support for hundreds of petabytes (PBs) of data and hundreds of gigabytes (GB/s) of throughput.

The key feature of Virtuozzo Infrastructure for this validation is its S3 API protocol support that ensures full compatibility with Vertica in Eon Mode.

Test environment

Architecture overview & validation scope

The validation process used the following technologies:

  • Vertica in Eon Mode version 24.1 deployed on a 3-node physical server cluster.
  • Virtuozzo Infrastructure S3 version 6.1 running on a 7-node storage cluster.

The architecture for the validation process is illustrated in the diagram below. It depicts the key components used during testing.

architecture overview

Data access during the validation was managed through Virtuozzo’s integrated S3 API, ensuring seamless interaction between Vertica and Virtuozzo Infrastructure S3.

Note: DNS with Round-Robin was used to load balance requests across the Virtuozzo S3 nodes during the tests.

Resources

The table below provides a list of resources and technologies used to support the validation process.

DescriptionOS versionDetails and specifications
Vertica Servers
3 Vertica database nodes
RHEL 9.2CPU Intel(R) Xeon(R) Gold 6226R CPU @
2.90GHz, 16 cores
256GB RAM
Vertica Depot resides on 4TB NVMe
100 G Mellanox Connect X5
Virtuozzo Infrastructure S3 cluster
7 Virtuozzo Infrastructure nodes
Virtuozzo Infrastructure 6.1CPU AMD EPYC 7352 24-Core Processor
128GB RAM
8 x The Ultrastar DC SN640 NVMe 7.68TB
Mellanox ConnectX-5 2x25Gb

Supporting information and considerations

The table below contains information related to the testing software.

DescriptionDetails
Vertica S3 API compatibility suiteDesigned to work with AWS S3 compatible object store.
Vertica S3 benchmark suiteThe S3 benchmark suite is based on TPC-DS. TPC Benchmark DS (TPC- DS) is a decision support benchmark that models several applicable aspects of a decision support system, including queries and data maintenance. A benchmark result measures query response time in single user mode, query throughput in multi-user mode, and data maintenance performance for a given hardware, operating system, and data processing system configuration under a controlled, complex, and multi-user decision support workload.

Virtuozzo Infrastructure S3 configuration

The Virtuozzo Infrastructure workflow includes the infrastructure setup and service provisioning. After the infrastructure setup, you will have a storage cluster with the configured network and the highly available management node. On top of the storage cluster, you can deploy and configure S3 services for provisioning to end users. All these tasks can be performed either in the admin panel or via the vinfra command-line tool. All of the deployment and configuration aspects can be found on the Welcome page of the Administrator Guide.

Next, you can create and manage the Virtuozzo Infrastructure S3 cluster. The process details, prerequisites, and features are described in the Provisioning object storage space section.

Validation tests

Compatibility testing

The following sets of tests are designed to validate Vertica in Eon Mode platform functionality when integrated with Virtuozzo Infrastructure S3 as an on-premises cloud storage appliance.

S3 API compatibility

Test goal: Verify that Virtuozzo Infrastructure S3 is S3 API compatible with Vertica in Eon Mode.

Pass criteria: All of the S3 API tests successfully pass.

Test status: The test was successfully executed.

Note: Per the EON_On_Prem_OS_S3_ObjectStore_TestingDetails_v3.0 we provided Vertica with the console output showing the results of the test.

Vertica Depot ON test with a 5-TB database

Test goal: Run the test_V3.sh script from the Vertica TPC-DS benchmark suite with a 5-TB database, Depot ON, 1 user, 3 users, and 1 repetition.

Pass criteria: The tests pass successfully without errors, and durations are reasonably close to those captured by Vertica in an AWS environment.

Test plan: Run the test_V3.sh script with the following parameters:

  • size=5000
  • users="1 3"
  • repetitions=1
  • dataexists=0
  • depotonforqueries=1

Non-listed parameters use the default values from the Vertica TPC-DS benchmark suite.

Test status: The test was successfully executed.

Note: Per the EON_On_Prem_withS3_ObjectStore_TestingDetails_v3.0 we provided Vertica with the console output showing the results of the test.

Vertica Depot OFF test with a 5-TB database

Test goal: Run the test_V3.sh script from the Vertica TPC-DS benchmark suite with a 5-TB database, Depot OFF, 1 user, 3 users, and 1 repetition.

Pass criteria: The tests pass successfully without errors, and durations are reasonably close to those captured by Vertica in an AWS environment.

Test plan: Run the test_V3.sh script with the following parameters:

  • size=5000
  • users="1 3"
  • repetitions=1
  • dataexists=1
  • depotonforqueries=0

Non-listed parameters use the default value from the Vertica TPC-DS benchmark suite.

Note: During this test, the data was not generated as it had been already generated during the Depot ON test.

Test status: The test was successfully executed.

Note: Per the EON_On_Prem_withS3_ObjectStore_TestingDetails_v3.0 we provided Vertica with the console output showing the results of the test.

Vertica database revive test

Test goal: Revive the database from Virtuozzo Infrastructure S3.

Pass criteria: The depot is successfully revived.

Test plan: Load 10 GB of data to the depot for these tests using the TPC-DS benchmark suite.

Test status: The database was successfully revived.

Note: Per the EON_On_Prem_withS3_ObjectStore_TestingDetails_v3.0 we provided Vertica with the console output showing the results of the test.

Vertica database backup and recovery

Test goal: Back up and recover the Vertica database.

Pass criteria: The Vertica database is successfully backed up and recovered.

Test plan: Back up and restore the database.

Test status: The backup and restore tests were successfully executed.

Note: Per the EON_On_Prem_withS3_ObjectStore_TestingDetails_v3.0 we provided Vertica with the console output showing the results of the test.

Test results

The table below shows the test results for all tasks.

This section tracks the pass or conditional pass of all the tests and short comments, if any, for each test. More verbose comments can be added in the Comments section after the table.

Note: Pass means that the test meets or beats the benchmark requirements. Conditional Pass means that it is mutually agreed that the test could meet or beat the benchmark requirements with correct sizing. If there were any blocking fails during testing, it would postpone to completing this document until resolved, thus there is no entry for Fail.
#Task executionTest result (Pass or Conditional Pass)Comments
1S3 API Compatibility TestPASS
2Performance and Stress Test (5 TB)
Data GenerationPASS
Load DataPASS
3Query Test DEPOT ON
Concurrency 1, Iteration 1PASS
Concurrency 3, Iteration 1PASS
4Query Test DEPOT OFF
Concurrency 1, Iteration 1PASS
Concurrency 3, Iteration 1PASS
5External Table TestPASS