PyData: Data Handling Session List
Fundamentals of relational databases
Katharina Rasch
DatabasesSomewhat comfortable with using SQL to access data, but curious to know what happens behind the scenes when you send off your query?
PPML: Machine Learning on Data you cannot see
Valerio Maggio
Neural Networks / Deep Learning, SecurityHave you ever wondered how to train your @PyTorch model on private data you cannot see? If you want to know how, this is the workshop for you! #PPML cc/ @openminedorg
Processing Open Street Map Data with Python and PostgreSQL
Travis Hathaway
Data Engineering, Databases, GIS / Geo-AnalyticsOpen Street Map is a large, community supported data set covering the entire world. Learn how to process this data with Python and PostgreSQL as I walk you through creating projects of your own. Along the way, we learn how OSM data is structured, and how you can use it yourself.
Squirrel - Efficient Data Loading for Large-Scale Deep Learning
Dr. Thomas Wollmann
Distributed Computing, Neural Networks / Deep Learning, Parallel Programming / AsyncLearn why we built and open sourced a data infrastructure library for deep learning.
Using a database in a data science project - Lessons learned in production
Jacopo Farina
Data Engineering, DatabasesLessons learned in 4 years using Postgres in a machine learning project
What are data unit tests and why we need them
Theodore Meynard
Best Practice, Data EngineeringThis talk will introduce the concept of data unit tests and why they are important in the workflow of data scientists when building data products.
Filter