Name: Embracing Noise: How Data Corruption Can Make Models Smarter
Start: 2026-05-04T17:30
End: 2026-05-04T19:30
Location: PyData St. Louis

PyData St. Louis Join us at PyData St. Louis for a talk and community discussion on how noise and imperfect data can actually make machine learning models stronger. Machine learning is often built on the assumption of clean, high-quality data. In reality, data is messy, incomplete, and noisy. This session explores a powerful idea: introducing controlled corruption during training can improve robustness, reduce overfitting, and help models perform better in real-world conditions. Pizza and networking will begin at 5:30 PM, and the talk will start at 6:15 PM. In this session, we’ll cover: • Why real-world data is inherently noisy and imperfect • How data corruption can act as a form of regularization • Different techniques such as Gaussian noise, masking, and label flipping • How corrupted data improves generalization and resilience to distribution shifts • Practical examples in Python from computer vision and NLP The talk is beginner friendly and open to anyone interested in data science, machine learning, or applied AI. After the talk, we’ll leave time for questions, discussion, and networking from 6:50 PM to 7:30 PM with others in the local data community. Pizza will be provided. Who should attend? Anyone curious about data science, machine learning, or building more robust AI systems, including students, professionals, hobbyists, and beginners. Special thanks to Spark Coworking for providing the venue and supporting the local data science community. Come learn, connect, and be part of the PyData St. Louis community! PyData St. Louis is part of the global PyData community. PyData is an educational program of NumFOCUS, a nonprofit organization that promotes open practices in research, data, and scientific computing.

Embracing Noise: How Data Corruption Can Make Models Smarter

Discover More Events Near You