Bridging Theory and Practice: ML Solutions for Today’s Challenges
3 days, 20+ experts, and 25+ tech sessions and talks covering critical aspects of:
- Agentic and Generative AI
- Applied Machine Learning in the Real World
- ML Engineering and Optimization
The future of AI is unfolding. Don’t fall behind.
Stay ahead with DataPro, the free weekly newsletter for data scientists, AI/ML researchers, and data engineers.
From trending tools like PyTorch, scikit-learn, XGBoost, and BentoML to hands-on insights on database optimization and real-world ML workflows, you’ll get what matters, fast.
Stay sharp with DataPro. Join 115K+ data professionals who never miss a beat.
Business runs on data. Make sure yours tells the right story.
BIPro is your free weekly newsletter for BI professionals, analysts, and data leaders.
Get practical tips on dashboarding, data visualization, and analytics strategy with tools like Power BI, Tableau, Looker, SQL, and dbt.
Get smarter with BIPro. Trusted by 35K+ BI professionals, see what you’re missing.
This is the code repository for Python Feature Engineering Cookbook, published by Packt.
A complete guide to crafting powerful features for your machine learning models
Python Feature Engineering Cookbook, Third Edition, walks you through tools and methods to craft powerful features from tabular, transactional, and time-series data for robust machine learning models.
This book covers the following exciting features:
- Discover multiple methods to impute missing data effectively
- Encode categorical variables while tackling high cardinality
- Find out how to properly transform, discretize, and scale your variables
- Automate feature extraction from date and time data
- Combine variables strategically to create new and powerful features
- Extract features from transactional data and time series
- Learn methods to extract meaningful features from text data
If you feel this book is for you, get your copy today!
All of the code is organized into folders.
The code will look like the following:
date = "2024-05-17"
rng_hr = pd.date_range(date, periods=20, freq="h")
rng_month = pd.date_range(date, periods=20, freq="ME")
df = pd.DataFrame({"date1": rng_hr, "date2": rng_month})
Following is what you need for this book: If you're a machine learning or data science enthusiast who wants to learn more about feature engineering, data preprocessing, and how to optimize these tasks, this book is for you. If you already know the basics of feature engineering and are looking to learn more advanced methods to craft powerful features, this book will help you. You should have basic knowledge of Python programming and machine learning to get started.
With the following software and hardware list you can run all code files present in the book (Chapter 1-11).
Chapter | Software required | OS required |
---|---|---|
1-11 | Python 3.9 or greater Windows, macOS, or Linux | Windows, Mac OS X, and Linux (Any) |
Soledad Galli is a bestselling data science instructor, author, and open-source Python developer. As the leading instructor at Train in Data, she teaches intermediate and advanced courses in machine learning that have enrolled over 64,000 students worldwide and continue to receive positive reviews. Sole is also the developer and maintainer of the Python open-source library Feature-engine, which provides an extensive array of methods for feature engineering and selection. With extensive experience as a data scientist in finance and insurance sectors, Sole has developed and deployed machine learning models for assessing insurance claims, evaluating credit risk, and preventing fraud. She is a frequent speaker at podcasts, meetups, and webinars, sharing her expertise with the broader data science community.