Build Your Own LinkedIn Analytics

Build Your Own LinkedIn Analytics Part 12: What’s Next? Open Sourcing and Community

By Yingzhao Ouyang Posted on January 20, 2026 Posted in Build Your Own LinkedIn Analytics Tagged with data engineering, data engineering 101, databricks, linkedin, open source

In the 12th and final post of the series, I release the open-source repository that implements the LinkedIn analytics pipelines, and discuss future plans.

Build Your Own LinkedIn Analytics Part 11: Key Takeaways and Lessons Learned

By Yingzhao Ouyang Posted on January 5, 2026 Posted in Build Your Own LinkedIn Analytics Tagged with data engineering, data engineering 101, databricks, linkedin, retrospective

In the 11th and penultimate post of the series, I look back on what has been achieved, what can be done better and what has been learned.

Build Your Own LinkedIn Analytics Part 10: Observing the Pipeline

By Yingzhao Ouyang Posted on December 30, 2025 Posted in Build Your Own LinkedIn Analytics Tagged with data engineering, data engineering 101, data quality, databricks, observability

In the 10th post of the series, I show how to set up observability on our data pipeline to monitor its condition and act as necessary.

Build Your Own LinkedIn Analytics Part 9: Making a Maintainable Pipeline

By Yingzhao Ouyang Posted on December 22, 2025 Posted in Build Your Own LinkedIn Analytics Tagged with CI/CD, data engineering, data engineering 101, databricks, devops

In the 9th post of the series, I use a combination of Git and Databricks Asset Bundles to make the data pipeline easily deployable and maintainable.

Build Your Own LinkedIn Analytics Part 8: Orchestrating and Automating the Pipeline

By Yingzhao Ouyang Posted on December 1, 2025 Posted in Build Your Own LinkedIn Analytics Tagged with data engineering, data engineering 101, databricks, linkedin, orchestration

In the 8th post of the series, I convert the scattered pieces of data ingestion, processing and dashboarding into an orchestrated and automated data pipeline.

Build Your Own LinkedIn Analytics Part 7: Dashboard Design for Insights and Impact

By Yingzhao Ouyang Posted on November 24, 2025 Posted in Build Your Own LinkedIn Analytics Tagged with business intelligence, dashboards, data analytics, databricks, linkedin

In the 7th post of the series, I build a dashbord on Databricks for the ingested LinkedIn data.

Build Your Own LinkedIn Analytics Part 6: Modelling the Data

By Yingzhao Ouyang Posted on November 10, 2025 Posted in Build Your Own LinkedIn Analytics Tagged with data analytics, data engineering, data engineering 101, data modeling, databricks, linkedin

In the 6h post of the series, I explore the approaches to modelling the LinkedIn data in the gold layer.

Build Your Own LinkedIn Analytics Part 5: Cleaning and Transforming Data

By Yingzhao Ouyang Posted on November 3, 2025 Posted in Build Your Own LinkedIn Analytics

In the 5th article in the series, I cover the process of cleaning and transforming the LinkedIn data into a Single Source of Truth (SSOT) in the silver layer.

Build Your Own LinkedIn Analytics Part 4: Ingesting Data

By Yingzhao Ouyang Posted on October 27, 2025 Posted in Build Your Own LinkedIn Analytics Tagged with data engineering, data engineering 101, data ingestion, databricks, linkedin

In the 4th article in the series, I deep dive into the ingestion process into bronze layer and highlight relevant industry practices along the way.

Build Your Own LinkedIn Analytics Part 3: Creating the Data Architecture

By Yingzhao Ouyang Posted on October 20, 2025 Posted in Build Your Own LinkedIn Analytics

In this 3rd article in the series I examine what LinkedIn data to ingest and where to ingest it from.

Category: Build Your Own LinkedIn Analytics

Pass it on:

Pass it on:

Pass it on:

Pass it on:

Pass it on:

Pass it on:

Pass it on:

Pass it on:

Pass it on:

Pass it on: