Datasets for the book - Applied Multivariate Statistics with Python

Published: 15 April 2026| Version 1 | DOI: 10.17632/6dngpsj9yc.1
Contributor:
Kenneth Szulczyk

Description

The book, Applied Multivariate Statistics with Python, offers a practical introduction to modern statistical analysis. It combines clear explanations with hands-on applications. It begins with core topics such as hypothesis testing and key probability distributions, then moves into experimental design and ANOVA techniques. Readers learn how to implement these methods using Python, making the material directly applicable. The book also covers multiple regression, including model building and common challenges like multicollinearity and heteroscedasticity. Advanced topics such as MANOVA, discriminant analysis, cluster analysis, and logistic regression are presented in an accessible way. It concludes with machine learning and neural networks for predicting both cross-sectional and time-series data. The datasets used in the book are posted here for easy access.

Files

Steps to reproduce

Instructions are in the book

Institutions

Categories

Machine Learning, Multivariate Analysis, Linear Regression, Logit Regression, Neural Network

Licence