Speaker: Jiaqi Liu
It’s one thing to build a robust data pipeline process in python but a whole other challenge to find tooling and build out the framework that allows for testing a data process. In order to truly iterate and develop a codebase, one has to be able to confidently test during the development process and monitor the production system.
In this talk, I hope to address the key components for building out end to end testing for data pipelines by borrowing concepts from how we test python web services. Just like how we want to check for healthy status codes from our API responses, we want to be able to check that a pipeline is working as expected given the correct inputs. We’ll talk about key features that allows a data pipeline to be easily testable and how to identify timeseries metrics that can be used to monitor the health of a data pipeline.
Slides can be found at: https://speakerdeck.com/pycon2018 and https://github.com/PyCon/2018-slides
It’s one thing to build a robust data pipeline process in python but a whole other challenge to find tooling and build out the framework that allows for testing a data process. In order to truly iterate and develop a codebase, one has to be able to confidently test during the development process and monitor the production system.
In this talk, I hope to address the key components for building out end to end testing for data pipelines by borrowing concepts from how we test python web services. Just like how we want to check for healthy status codes from our API responses, we want to be able to check that a pipeline is working as expected given the correct inputs. We’ll talk about key features that allows a data pipeline to be easily testable and how to identify timeseries metrics that can be used to monitor the health of a data pipeline.
Slides can be found at: https://speakerdeck.com/pycon2018 and https://github.com/PyCon/2018-slides
Jiaqi Liu - Building a Data Pipeline with Testing in Mind - PyCon 2018 tutorial systems | |
45 Likes | 45 Dislikes |
3,541 views views | 17.9K followers |
People & Blogs | Upload TimePublished on 13 May 2018 |
Không có nhận xét nào:
Đăng nhận xét