Tags
Language
Tags
June 2025
Su Mo Tu We Th Fr Sa
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 1 2 3 4 5
    Attention❗ To save your time, in order to download anything on this site, you must be registered 👉 HERE. If you do not have a registration yet, it is better to do it right away. ✌

    ( • )( • ) ( ͡⚆ ͜ʖ ͡⚆ ) (‿ˠ‿)
    SpicyMags.xyz

    Data pre-processing for machine learning in Python (Your Data Teacher Books Book 2)

    Posted By: naag
    Data pre-processing for machine learning in Python (Your Data Teacher Books Book 2)

    Data pre-processing for machine learning in Python (Your Data Teacher Books Book 2)
    English | 2022 | ASIN: B0B5961KTR | 86 pages | PDF | 2.94 MB

    In this book, the author shows the practical use of Python programming language to perform pre-processing tasks in machine learning projects. Pre-processing is the set of transformations to be applied to a dataset before it can be used to train a machine learning model. It's a very important phase of a data science pipeline because a wrong pre-processing will give a very poor performance of the model, while a good pre-processing is able to make the model learn properly.

    The pre-processing transformations shown in this book are:
    Data cleaning
    Encoding of the categorical variables (one-hot encoding and ordinal encoding)
    Principal Component Analysis
    Scaling (normalization, standardization, robust scaling)
    Binarizing
    Binning
    Power transformations
    Filter-based feature selection
    Oversampling using SMOTE
    All the transformations are described both in theory and in practice using Python programming language and its powerful scikit-learn library.

    About the author
    Gianluca Malato was born in 1986 and he is an Italian data scientist, teacher and author. In 2010, he received his Master’s Degree cum laude in Theoretical Physics of disordered systems at “La Sapienza” University of Rome (thesis advisors: Giorgio Parisi and Tommaso Rizzo). He has been working for years as a data architect, project manager, data analyst and data scientist for a large Italian company.

    He is the founder of yourdatateacher.com, an online school where he teaches Data Science, Machine Learning, R, Python and SQL language using online courses and individual online training programs.

    He has published several articles about Data Science on his blog yourdatateacher.com and on Towards Data Science online publication (towardsdatascience.com). He received the “Top Writer” mention on Medium.com in the “Artificial Intelligence” category for his articles.