Comment on page

Tools

​
  1. 1.
    ​Debezium - an open source distributed platform for change data capture
  2. 2.
    ​Hudi - "Hudi is a rich platform to build streaming data lakes with incremental data pipelines on a self-managing database layer, while being optimized for lake engines and regular batch processing."
  3. 3.
    ​Upsolver - "Continuous SQL Pipelines for Cloud Data Lakes. No custom coding. No orchestration. No infrastructure maintenance."
  4. 4.
    ​DBT - "dbt helps data teams work like software engineers—to ship trusted data, faster. collaboratively deploy analytics code following software engineering best practices like modularity, portability, CI/CD, and documentation. Now anyone who knows SQL can build production-grade data pipelines."
    1. 1.
      ​intro​
    2. 2.
      ​in depth intro​
    3. 3.
      ​dbt in one hour​
    4. 4.
      ​CI/CD with dbt​
  5. 5.
    ​Metorikku - A simplified, lightweight ETL Framework based on Apache Spark
  6. 7.
    ​Stitch - Stitch rapidly moves data from 130+ sources into a data warehouse so you can get to answers faster, no coding required.
  7. 8.
    ​SnowPlow - Generate complete, accurate and well-structured event data across all platforms and channels in a common format, with the Snowplow Behavioral Data Platform.
  8. 9.
    ​Workato - A SINGLE PLATFORM FOR INTEGRATION & WORKFLOW AUTOMATION ACROSS YOUR ORGANIZATION
  9. 10.
    ​AWS Deequ - Test data quality at scale