The Data & AI Wiki: DataAIWiki.com

Welcome to the Data & AI Wiki, a collaborative and curated repository of knowledge dedicated to modern data infrastructure, engineering platforms, open source table formats, semantic layer integrations, and Artificial Intelligence concepts.

The purpose of this wiki is to serve as an authoritative, community-driven resource for developers, architects, and practitioners navigating the rapidly evolving landscape of data systems and intelligence applications.

  • Individuals: Learn about prominent contributors, advocates, and authors shaping the data engineering and AI space.
  • Vendor Platforms: In-depth overviews of commercial cloud data platforms, engines, and semantic integration systems.
  • Open Source Tools: Technical breakdowns of open data table formats, data catalog systems, orchestration engines, and vector databases.
  • Guides & Tutorials: Step-by-step walk-throughs covering lakehouse optimization, query federation, and catalog setups.
  • Core Terms & Concepts: Definitions and reference architectures for standard concepts in modern data stack and AI pipelines.

How to Contribute

This wiki is hosted as a GitHub repository. To propose additions, updates, or correct content, you can contribute directly to the repository at AlexMercedCoder/dataaiwiki. This site compiles those wiki articles statically to provide a blazing-fast, indexable, and accessible reading experience.

This page is mirrored from the GitHub Wiki. View original on GitHub