Databases

The ease of working with Ferris with it's ability to rewire services at will and integrate across clouds and services Agility into engineering.

amazon athena

Amazon Athena is a serverless, interactive query service to query data and analyze big data in Amazon S3 using standard SQL.

amazon redshift

Amazon Redshift is a data warehouse product which forms part of the larger cloud-computing platform Amazon Web Services. It is built on top of technology from the massive parallel processing data warehouse company ParAccel, to handle large scale data sets and database migrations.

apache drill

Apache Drill is an open-source software framework that supports data-intensive distributed applications for interactive analysis of large-scale datasets.

apache druid

Druid is a column-oriented, open-source, distributed data store written in Java. Druid is designed to quickly ingest massive quantities of event data, and provide low-latency queries on top of the data.

apache hive

Apache Hive is a data warehouse software project built on top of Apache Hadoop for providing data query and analysis. Hive gives an SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop.

apache impala

Apache Impala is an open source massively parallel processing SQL query engine for data stored in a computer cluster running Apache Hadoop. Impala has been described as the open-source equivalent of Google F1, which inspired its development in 2012.

apache kylin

Apache Kylin is an open source distributed analytics engine designed to provide a SQL interface and multi-dimensional analysis on Hadoop and Alluxio supporting extremely large datasets. It was originally developed by eBay, and is now a project of the Apache Software Foundation.

apache pinot

Apache Pinot is a column-oriented, open-source, distributed data store written in Java. Pinot is designed to execute OLAP queries with low latency. It is suited in contexts where fast analytics, such as aggregations, are needed on immutable data, possibly, with real-time data ingestion.

apache solr

Solr is an open-source enterprise-search platform, written in Java. Its major features include full-text search, hit highlighting, faceted search, real-time indexing, dynamic clustering, database integration, NoSQL features and rich document handling.

apache spark sql

Spark SQL brings native support for SQL to Spark and streamlines the process of querying data stored both in RDDs (Spark's distributed datasets) and in external sources.

ascend

Experience continuously optimized data pipelines with less code and breakages. Enter the new era of data engineering with Ascend.

clickhouse

ClickHouse is an open-source column-oriented DBMS for online analytical processing that allows users to generate analytical reports using SQL queries in real-time.

cockroachdb

CockroachDB is a distributed database with standard SQL for cloud applications.

cratedb

CrateDB is a distributed SQL database management system that integrates a fully searchable document-oriented data store. It is open-source, written in Java, based on a shared-nothing architecture, and designed for high scalability.

databricks sql

Databricks is an American enterprise software company founded by the creators of Apache Spark. Databricks develops a web-based platform for working with Spark, that provides automated cluster management and IPython-style notebooks.

dremio

Dremio is a high-performance SQL (data) lakehouse platform built on an open data architecture that helps to accelerate BI and Analytics directly on cloud data lake storage.

elasticsearch

Elasticsearch is a search engine based on the Lucene library. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents.

Exasol

Exasol is an analytics database management software company. Its product is called Exasol, an in-memory, column-oriented, relational database management system.

firebird

Firebird is an open-source SQL relational database management system that "runs on Linux, Microsoft Windows, macOS and several Unix platforms".

firebolt

Firebolt is a Cloud Data Warehousing solution that helps its users streamline their Data Analytics and access to insights. It offers fast query performance and combines Elasticity, Simplicity, Low cost of the Cloud, and innovation in Analytics.

google bigquery

BigQuery is a fully managed enterprise data warehouse that helps you manage and analyze your data with built-in features like machine learning, geospatial analysis, and business intelligence.

google sheets

Google Sheets is a cloud-based app with advanced capabilities of spreadsheets. It can also be utilized as a database for websites or small applications. Most organizations use it instead of other heavily-priced databases such as PostgreSQL, MySQL, etc., for storing and managing data in real-time

sap hana

SAP HANA in-memory database is for transactional and analytical workloads with any data type — on a single data copy.

hologres

Hologres is a cloud-native Hybrid Serving & Analytical Processing (HSAP) system that is seamlessly integrated with the big data ecosystem.

ibm db2

Db2 is a family of data management products, including database servers, developed by IBM. They initially supported the relational model, but were extended to support object–relational features and non-relational structures like JSON and XML.

ibm netezza performance server

IBM® Netezza® Performance Server for IBM Cloud Pak® for Data is an advanced data warehouse and analytics platform available both on premises and on cloud.

microsoft sql server

Microsoft SQL Server is a relational database management system developed by Microsoft. As a database server, it is a software product with the primary function of storing and retrieving data as requested by other software applications

mysql

MySQL is a relational database management system (RDBMS) developed by Oracle that is based on structured query language (SQL).

oracle

Oracle Database is the first database designed for enterprise grid computing, the most flexible and cost effective way to manage information and applications.

postgresql

PostgreSQL is a powerful, open source object-relational database system with over 30 years of active development that has earned it a strong reputation for reliability.

presto

Presto (or PrestoDB) is an open source, distributed SQL query engine, designed from the ground up for fast analytic queries against data of any size.

rockset

Rockset is a real-time analytics database for serving fast analytics at scale, enabling developers to build modern data apps in record time.

snowflake

Snowflake is a data-warehousing platform. Snowflake provides an enterprise solution that makes the gathering, processing, using big data easy.

teradata

Teradata Vantage is the connected multi-cloud data platform for enterprise analytics that delivers actionable answers and predictive intelligence.

trino

Trino is an ANSI SQL compliant query engine, that works with BI tools such as R, Tableau, Power BI, Feris and many others.

vertica

Vertica provides a best-in-class, unified analytics platform that will forever be independent from underlying infrastructure.

yugabytedb

YugabyteDB is a high-performance transactional distributed SQL database for cloud-native applications

<