Data engineering is a field that can be thought as a superset of business intelligence and data warehousing which brings more elements from software engineering. The reason data engineering exists today is because companies have massive treasure troves of data, but to provide value the data must be extracted. Data engineering provides the toolbox and is how we make sense of that data quickly and effectively.
When it comes to managing data collection, munging and consumption, data pipeline frameworks play a significant role and with the help of Apache Airflow, task of creating data pipeline is not only easy but its actually fun. Originated from AirBnb, Airflow soon became part of the very core of their tech stack.
The data infrastructure ecosystem has yet to show any sign of converging into something more manageable. It seems like we’re still in a huge phase of expansion where every new day bring new distributed database, new frameworks, new libraries and new teammates. As these systems get more complicated and evolve rapidly, it becomes even more important to have something like Apache Airflow that brings everything together in a sane place where every little piece of the puzzle can be orchestrated properly with sane APIs.
Specification: Practical Apache Airflow
|
User Reviews
Be the first to review “Practical Apache Airflow” Cancel reply
This site uses Akismet to reduce spam. Learn how your comment data is processed.
Price | $9.99 |
---|---|
Provider | |
Duration | 3 hours |
Year | 2020 |
Level | All |
Language | English ... |
Certificate | Yes |
Quizzes | No |
There are no reviews yet.