Machine learning applications and operations

 


CAIML #23 took place on 25 January at taod, with food sponsored by how.fm. There were two short talks on machine learning applications and machine learning operations, and plenty of time for social networking with delicious food and drinks. It was nice to meet again face-to-face for exchanges after the Corona break.

Organizer

Speakers

Christopher König, Data Scientist at taod Consulting GmbH:

What the Fuzz? Record Linkage using SPLINK

“We need more data” & “Garbage in – garbage out” are no empty phrases. Merging data from heterogeneous sources, while ensuring data quality are paramount for successful AI / ML applications. Drafting from a real-world example of duplicated customer data, this talk shares hands-on experience on Probabilistic Record Linkage. After introducing the statistical theory of the Fellegi-Sunter (1969) model, we walk you through the most important modeling decisions data scientists face in this context. On the way, you will learn how to implement Record Linkage Models using SPLINK on a PySpark back-end.

Dr. Richard Sieg, Data Scientist at how.fm:

Getting Your Model to Production with AWS Sagemaker

When talking about Machine Learning, we mostly focus on the design & training of models for specific use cases. However, the subsequent step of serving a model in our products is also very crucial and intricate. In this talk, I will show you how to host a model as a scalable & maintainable API using AWS and its ML platform Sagemaker. You can use this API as a template that can be easily adapted to your use case. We will take a closer look at the different layers of the API, its deployment, and how to configure the API for different scenarios & stages. This talk will be quite hands-on and you will see a lot of code – which you do not need to copy since I will provide you with a GitHub repository in the end.

Venue

taod Consulting GmbH, Oskar-Jäger-Straße 173 in Cologne

Cologne AI & Machine Learning Meetup

 

Looking forward to seeing you in spring again

 

Retrospective

Deep Learning Cologne

Applying ML to challenging clinical data

AI for social good

Current state of AI

Automated Data Science