Multi-tenant gateway providing serverless SQL on data lakehouses is now an Apache Top-Level Project
Wilmington, DE – January 19, 2023 (GLOBE NEWSWIRE) – The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 open source projects and initiatives, announced today Apache® Kyuubi has graduated and is now a Top-Level Project (TLP).
“We are thrilled to see Kyuubi graduate from incubation up to a Top-Level Project. Since first going open source in 2018, our community has worked hard to get Kyuubi up to this graduation stage. It’s a tribute to their hard work and commitment,” said Willem Ning Jiang, PPMC Member and Incubator. “We look forward to working with the community to further Kyuubi’s tenancy, availability, and workload management.”
Apache Kyuubi is a distributed and multi-tenant gateway providing serverless SQL on data warehouses and data lakehouses. Kyuubi provides a pure SQL gateway through Thrift JDBC/ODBC interface for end-users to manipulate large-scale data with pre-programmed and extensible Spark SQL engines. This minimizes the barriers and costs for end-users to use Spark on the client side. On the server side, the Kyuubi server and engines’ multi-tenant architecture provides administrators a way to achieve computing resource isolation, data security, high availability, and high client concurrency.
Download the latest release of Apache Kyuubi: https://kyuubi.apache.org/releases.html
Kyuubi Overview and Advantages
To achieve the goal of providing serverless SQL on data warehouses and lakehouses, customers need to be able to access data from anywhere, to deploy data at scale, and to connect to any data. Kyuubi’s meets these requirements with:
- Unified Gateway (access data from anywhere):
- The server module plays the role of a unified gateway, offering a single entry point to any cluster. Administrators have a single access point for configuration, security, and control of remote access to clusters, which in turn provides end users with seamless data processing with any Kyuubi engine they need. Various protocols and APIs, including Apache Thrift™, JDBC, ODBC, REST, etc., are supported.
- Distributed Engines (deploy data at scale):
- The engine module plays the role of query runtime, supporting state-of-the-art distributed computing frameworks so end users can process data on single-node machines or clusters, such as YARN and Kubernetes. The engines run under the governance of the server in multi-tenant and multi-application environments to handle different workloads at any scale. Server-side global and continuous optimization can be applied to all applications by administrators, which frees end users from complex performance tuning.
- Diverse Connectors (connect to any data):
- The connectors are part of the plugin system, which allows end users to connect to any data. For basic discovery and exploration, end users can quickly analyze data in the data lake in various formats (Parquet, CSV, JSON, text) from cloud-based storage or on-prem HDFS clusters. For lakehouse formation and analytics, end users can easily build an ACID table storage layer with Hudi, Iceberg, or/and Delta Lake integrated. They can also build a relational abstraction on top of disparate data without exact, transform and load (ETL) jobs, which optimizes data solutions from collecting to connecting.
- Download: https://kyuubi.apache.org/releases.html
- Docs: https://kyuubi.readthedocs.io/en/master/
- GitHub: https://github.com/apache/kyuubi
- Contribute: https://kyuubi.apache.org/become_committer.html
About The Apache Incubator
The Apache Incubator is the primary entry path for projects and codebases wishing to become part of the efforts at The Apache Software Foundation. All code donations from external organizations and existing external projects enter the ASF through the Incubator to: 1) ensure all donations are in accordance with the ASF legal standards; and 2) develop new communities that adhere to our guiding principles. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision-making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF. For more information, visit https://incubator.apache.org/.
About The Apache Software Foundation (ASF)
Founded in 1999, the Apache Software Foundation exists to provide software for the public good with support from more than 70 sponsors. ASF’s open source software is used ubiquitously around the world with more than 8,400 committers contributing to 320+ active projects, including Apache Superset, Apache Camel, Apache Flink, Apache HTTP Server, Apache Kafka, and Apache Airflow. The Foundation’s open source projects and community practices are considered industry standards, including the widely adopted Apache License 2.0, the podling incubation process, and a consensus-driven decision model that enables projects to build strong communities and thrive. https://apache.org
ASF’s annual ApacheCon event is where open source technologists convene to share best practices and use cases, forge critical relationships, and learn about advancements in their field. https://www.apachecon.com/
© The Apache Software Foundation. “Apache” and “ApacheCon” are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.