In this hands-on workshop, you'll dive deep into advanced Data Science techniques, focusing on efficient data manipulation, large-scale transformations, and structured data analysis. Learn to reshape and optimize datasets using Pandas, including advanced DataFrame operations such as melt, stack, unstack, pivot tables, and unpivoting. Work with SQLite for structured data storage and apply vectorized operations for scalable analysis.
Apply advanced data techniques to process and analyze complex datasets
Master DataFrames, Large-Scale Data Processing, and Advanced Data Transformations
Advanced DataFrame Operations – Work with multi-indexing, hierarchical data, and optimized transformations.
Mastering Data Cleaning Techniques – Handle missing values, detect outliers, normalize datasets, and preprocess large datasets.
Unpivoting & Advanced DataFrame Transformations – Learn melt, stack, unstack, and pivot tables to reshape and manipulate complex datasets.
Analyzing Data at Scale – Process large datasets using vectorized operations, memory mapping, and optimized queries with SQLite.
Time-Series Analysis & Forecasting – Detect trends, seasonality, and anomalies using autoregressive models and moving averages.
Hands-On Projects – Apply what you’ve learned to two real-world projects:
Pandas USA.gov Web Traffic Data Analysis
Pandas US Baby Names Data Analysis
Taught by award-winning instructors from Carnegie Mellon University
2PM - 5PM
5 Medford Street, Arlington MA 02474
(Right next to the Regent Theater in Arlington)
Hands-on, In-Person Live Workshop
✅ Live, Hands-On Instruction – Solve exercises and code together in an interactive, in-person setting.
✅ Hands-On Data Exploration – Work through structured exercises using real-world datasets.
✅ Real-World Projects – Work on two practical projects to reinforce learning and application.