veloce

module

v0.0.0-...-d4ac72b Latest Latest Go to latest Published: May 15, 2024 License: MIT

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/tfmv/veloce

README ¶

🏹 ArrowLake

The Robin Hood of Data Architecture

Alt text

Welcome to the Sherwood Forest of Big Data - ArrowLake! Crafted by the merry data outlaws at Veloce Data Solutions, ArrowLake aims to liberate the data landscape from the clutches of overpriced and cumbersome big data platforms. Much like the legendary Robin Hood, ArrowLake is here to provide a powerful, cost-effective solution for all, championing the cause of efficient and accessible data processing.

🌳 About ArrowLake

In the heart of our data forest, ArrowLake stands as a beacon of innovation, blending the art of Rust and the wisdom of DataFusion. Armed with the prowess of Apache Arrow and Apache Iceberg, this platform is on a quest to surpass the titans of big data realms, but without plundering your coffers!

⚔ Key Features

Apache Arrow Arsenal: Leveraging the in-memory columnar might of Apache Arrow, ensuring swift and efficient data processing.
Rust Strength: Utilizing Rust's performance and safety features to build a robust data processing platform.
BigLake Integration: Seamlessly integrate with Google Cloud's BigLake to enable federated queries across BigQuery and data in GCS.
Apache Iceberg: Utilize Iceberg's powerful table format to manage large analytic datasets on GCS.
Merry Cost-Efficiency: Crafted not for the kings and queens but for the common folk - offering top-tier capabilities without the royal price tag.
Scalable Stronghold: Constructed to grow with your needs, scaling without faltering, just as Robin's band of merry men grew in strength and number.
Open Source Fellowship: A community for all - open, collaborative, and thriving on innovation.

📜 Prerequisites

Equip yourself with Rust - the weapon of choice in our data realm.
Arm yourself with Apache Arrow, Apache DataFusion, and Apache Iceberg libraries.
Embark with a basic map of data processing and analytics territory.
Google Cloud Platform (GCP) account for BigLake and BigQuery integration.

🏰 Architecture

Every inch of ArrowLake's architecture is crafted for resilience, scalability, and efficiency:

Swift Data Ingestion: As fast as Robin's arrows, leveraging Apache Arrow for efficiency.
Mighty Processing Engine: Powered by DataFusion and Rust, ensuring robust and high-performance data processing.
Fortified Storage: Utilizing Apache Iceberg for managing large datasets on GCS.
Federated Query Engine: Enabling seamless querying across BigQuery and Iceberg tables stored in GCS through BigLake.

🤝 Contributing

Join our band of merry contributors! Whether you're a bard singing tales of new features, a blacksmith forging fixes, or a scout spreading the word, your contributions are the lifeblood of ArrowLake. Check out CONTRIBUTING.md for guidelines.

🧭 Roadmap

✔ Initial foray into design and architecture
🚧 Integrating Rust and DataFusion
🔭 Enhancing federated query capabilities
🧑🤝🧑 Rallying the open-source community
📈 Sharpening performance for the battles ahead

📄 License

ArrowLake is bestowed upon the realm under the MIT License. Refer to the LICENSE scroll for more details.

🏹 Author

Thomas F McGeehan V - The Robin Hood of Data!

Directories ¶

Path	Synopsis
ArrowLake
scrap/sandbox
scrap/scratch/dft

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL