What Is Change Data Capture - Understanding Data Engineering 101

Seattle Data Guy
Seattle Data Guy
8.9 هزار بار بازدید - پارسال - Companies continue to look for
Companies continue to look for methods to gain near-real-time access to their data for analytics. Honestly, this has been ongoing since I got into the data industry a decade ago…and well before that.

One possible choice is a method called change data capture, also known as CDC. I have seen companies employ multiple ways to use CDC or CDC-like approaches to pull data from databases like Postgres or MongoDB.

This can range from using triggers to reading logs.

Of course, this focuses on the analytical component as many companies use CDC to replace or supplement traditional ETL/ELT.

But CDC can also be a great way to understand your database and its structure. Databases abstract much of what they do to manage and process large volumes of data quickly.

Here are a a few articles on CDC if you need to learn more

How To Implement CDC With Kafka
https://estuary.dev/change-data-captu...

Simple CDC with Debezium + Kafka
Medium: simple-cdc-with-debezium-kafka

0:00 - Intro
1:12 - Write Ahead Logs (WAL)
2:30 - Trigger Based CDC
4:30 - Examples Of Where I Have Used Change Data Capture

If you enjoyed this video, check out some of my other top videos.

Top Courses To Become A Data Engineer In 2022
Top Courses To Become A Data Engineer...

What Is The Modern Data Stack - Intro To Data Infrastructure Part 1
What Is The Modern Data Stack - Intro...

If you would like to learn more about data engineering, then check out Googles GCP certificate
https://bit.ly/3NQVn7V

If you'd like to read up on my updates about the data field, then you can sign up for our newsletter here.

https://seattledataguy.substack.com/​​

Or check out my blog
https://www.theseattledataguy.com/

And if you want to support the channel, then you can become a paid member of my newsletter
https://seattledataguy.substack.com/s...


Tags: Data engineering projects, Data engineer project ideas, data project sources, data analytics project sources, data project portfolio

_____________________________________________________________
Subscribe: @seattledataguy
_____________________________________________________________
About me:
I  have spent my career focused on all forms of data. I have focused on developing algorithms to detect fraud, reduce patient readmission and redesign insurance provider policy to help reduce the overall cost of healthcare. I have also helped develop analytics for marketing and IT operations in order to optimize limited resources such as employees and budget. I privately consult on data science and engineering problems both solo as well as with a company called Acheron Analytics. I have experience both working hands-on with technical problems as well as helping leadership teams develop strategies to maximize their data.

*I do participate in affiliate programs, if a link has an "*" by it, then I may receive a small portion of the proceeds at no extra cost to you.
پارسال در تاریخ 1402/04/01 منتشر شده است.
8,924 بـار بازدید شده
... بیشتر