Data Insights Platform 1.0
> Note, this is a broader epic to track the development of the Data Insights Platform towards a v1.0 implementation running in production. We also carve-out subsets of necessary features into quarter-specific sub-epics and track them individually. >> Update: This is now applicable with Data Insights Platform powering Usage Billing on `.com`. ![Overview_v2](/uploads/a91d353f5b65f52752d559705fc9ce4c/Overview_v2.png) ### FY26Q1 With a [first implementation of the Data Insights Platform](https://gitlab.com/groups/gitlab-org/architecture/gitlab-data-analytics/-/epics/5#note_2303202230) delivered in FY26 Q1, this epic tracks further work to ensure the Platform can scale as needed and be well-integrated within GitLab. As we learn more & mature our early implementation(s) to ensure scalability & reliability around ingesting, processing & querying necessary data, the following deliverables were/are tracked with their timelines: ### FY26 Q2 * First implementation of data anonymisation/pseudoanonymisation. ~"workflow::complete" * Prototype data querying capabilities within the platform. ~"workflow::complete" * Repeatable testing/staging deployments of the Platform. ~"workflow::complete" * Performance/scale testing. ~"workflow::complete" * Sharding incoming data across multiple NATS streams for scalability. ~"workflow::complete" * Instrumentation to allow monitoring key user-journeys. ~"workflow::complete" ### FY26 Q3/Q4 * Onboard and/or empower our first use-case in production - [Usage Billing](https://gitlab.com/groups/gitlab-org/analytics-section/-/epics/14). ~"workflow::complete" * Complete production readiness for Usage Billing + NATS. ~"workflow::complete" * Complete production readiness for Usage Billing + Data Insights Platform. ~"workflow::complete" * Develop Siphon producer-only deployments. ~"workflow::in dev" ### FY27 Q1 - potentially DIP 2.0 * [Use Data Insights Platform to ingest Snowplow data into S3 for consumption from Snowflake](https://gitlab.com/groups/gitlab-org/architecture/gitlab-data-analytics/-/work_items/10). * Potential integrations with Data Catalog during data enrichment. * Further develop data querying capabilities within Query API. * Export Siphon-ingested data in Iceberg format. ### Deferred for now * Adding support for ingesting `CloudEvents` for general purpose events.
epic