In this talk we share our experiences of the MLOps platform we build for our customers and how they boost the way data scientists work. We show how their projects of ML models training can go from zero to production in much shorter time, while achieving superior performance, high code quality, training repeatability and governance. Also interesting from the deployment perspective, the platform mixes best-of-breed cloud managed services with a small number of powerful open-source components (e.g. Kedro, MLflow, Seldon) to get extra functionality that data scientists and their ML models need. It should be interesting to anyone from data scientists to platform builders or architects.
- Architecture of a production cloud-managed MLOps platform that is based on lessons learned from real-world projects.
- What data scientists can do on such MLOps platform, how it boosts his work and shortens project’s time to production.
- What open-source tools we use and what extensions we develop to make that happen and fill the gaps
- How this platform can be deployed in the cloud or even on-premise allowing to mix and match managed and open-source components
Krzysztof Zarzycki – CTO | GetInData
Krzysztof is a passionate about technologies hands-on architect and builder of big data solutions. Over the last 12 years he has helped build many big data, stream processing, analytics, and data science solutions for enterprises, large and smaller startups. Playing also a CTO role at GetInData he leads research of next generation platforms around DataOps, MLOps and Stream Processing, defining solution blueprints and the best engineering practices that maximize success rate of GetInData projects.
Marek Wiewiórka – Chief Data Architect | GetInData
Big data architect with 15+ years of experience, DEV/MLOps practitioner with strong background in classic data warehousing solutions and database designs. Helping customers to build modern data analytics platforms by combining containerization with big data technologies. Researcher, Phd student in bioinformatics that tries to design distributed algorithms for analysis of data from next generation sequencing. Cofounder of biodatageeks research group at Warsaw University of Technology. Long-distance runner and biker, hiking lover.
Day 2 | 19th of May – Engineering
Krzysztof Zarzycki – CTO & Marek Wiewiórka – Chief Data Architect | GetInData