Managing Datasets and Models / / Oswald Campesato.

This book contains a fast-paced introduction to data-related tasks in preparation for training models on datasets. It presents a step-by-step, Python-based code sample that uses the kNN algorithm to manage a model on a dataset. Chapter One begins with an introduction to datasets and issues that can...

Full description

Saved in:
Bibliographic Details
VerfasserIn:
Place / Publishing House:Dulles, VA : : Mercury Learning and Information, , [2023]
©2023
Year of Publication:2023
Language:English
Online Access:
Physical Description:1 online resource (368 p.)
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This book contains a fast-paced introduction to data-related tasks in preparation for training models on datasets. It presents a step-by-step, Python-based code sample that uses the kNN algorithm to manage a model on a dataset. Chapter One begins with an introduction to datasets and issues that can arise, followed by Chapter Two on outliers and anomaly detection. The next chapter explores ways for handling missing data and invalid data, and Chapter Four demonstrates how to train models with classification algorithms. Chapter 5 introduces visualization toolkits, such as Sweetviz, Skimpy, Matplotlib, and Seaborn, along with some simple Python-based code samples that render charts and graphs. An appendix includes some basics on using awk. Companion files with code, datasets, and figures are available for downloading. Features: Covers extensive topics related to cleaning datasets and working with models Includes Python-based code samples and a separate chapter on Matplotlib and Seaborn Features companion files with source code, datasets, and figures from the book
Format:Mode of access: Internet via World Wide Web.
ISBN:9781683929512
DOI:10.1515/9781683929512
Access:restricted access
Hierarchical level:Monograph
Statement of Responsibility: Oswald Campesato.