Reference#

API#

Index

Plugins#

Some Atoti features require large additional libraries and might not be useful in every projects. To keep the core library as light as possible, these features are packaged into separate plugins that can be installed when needed.

Available plugins#

Data loading#

atoti_jdbc

Code to interact with databases through JDBC.

atoti_kafka

Code to interact with Kafka streams.

Cloud storage#

atoti_aws

Code to load CSV and Parquet files from AWS S3 into Atoti tables.

atoti_azure

Code to load CSV and Parquet files from Azure Blob Storage into Atoti tables.

atoti_gcp

Code to load CSV and parquet files from Google Cloud Storage into Atoti tables.

These connectors open tens of HTTP connections to the cloud storage in order to transfer the data in parallel. They then transparently reassemble the blocks directly in memory. They can load up to 300 GB in about 5 minutes. Some parameters can impact the overall download speed:

  • Bandwidth of the network interface.

  • Speed of the CPU cores since HTTPS connections and client side-encryption consume CPU resources.

  • File size: small files will not have good download speed (< 60 MB/s).

  • Type (hot/cold) of the storage: hot storage is faster.

  • Data locality: best when the host running Atoti and the data are in the same cloud region.

DirectQuery#

See how to Use DirectQuery.

atoti_directquery_bigquery

Code to use DirectQuery on Google BigQuery.

atoti_directquery_clickhouse

Code to use DirectQuery on ClickHouse.

atoti_directquery_databricks

Code to use DirectQuery with Databricks.

atoti_directquery_jdbc

Code to use DirectQuery through JDBC.

atoti_directquery_mssql

Code to use DirectQuery with Microsoft SQL Server.

atoti_directquery_redshift

Code to use DirectQuery on Amazon Redshift.

atoti_directquery_snowflake

Code to use DirectQuery on Snowflake.

atoti_directquery_synapse

Code to use DirectQuery on Azure Synapse Analytics.

Installation#

A plugin can be installed as a Python package or as a Conda package.

For instance, to install the DirectQuery JDBC plugin:

Python package#

poetry add "atoti[directquery-jdbc]"

Or with pip:

pip install "atoti[directquery-jdbc]"

Multiple plugins can be installed with the “extras” syntax:

poetry add "atoti[directquery-jdbc,sql]"

Conda package#

conda install atoti atoti-directquery-jdbc