Author: Khattab, Abdal Hamid Rabia Mohamed./ Title: Data Cleansing and Additional Processing for Efficient Classification Models /

Search In this Thesis

العنوان

Data Cleansing and Additional Processing for Efficient Classification Models /

المؤلف

Khattab, Abdal Hamid Rabia Mohamed.

هيئة الاعداد

باحث / عبد الحميد ربيع محمد خطاب

مشرف / محمود محمد فهمى

مشرف / محمد عرفة البدرى

مشرف / ندا محمد طه الشناوى

الموضوع

Computer and Control Engineering.

تاريخ النشر

2024.

عدد الصفحات

94 p. :

اللغة

الإنجليزية

الدرجة

الدكتوراه

التخصص

Computational Mechanics

تاريخ الإجازة

16/4/2024

مكان الإجازة

جامعة طنطا - كلية الهندسه - هندسة الحاسبات والتحكم الالى

الفهرس

Only 14 pages are availabe for public view

from

120

from

120

Abstract

The process of data cleansing involves identifying and correcting errors and inconsistencies in raw data to ensure its quality. In machine learning, classification models are used to predict the category or class of new data points based on training data. The accuracy and performance of classification models are heavily reliant on the quality of the training data. Consequently, data cleansing is considered a crucial and indispensable step in the modeling process. The meticulous examination and refinement of the data are essential to ensure its integrity, consistency, and relevance. By undertaking this imperative procedure, researchers and academics can enhance the reliability and validity of their classification models, thereby enabling more accurate predictions and insightful analysis. Data cleaning tasks include Outlier detection, removing duplicates, filling in missing values,correcting inaccuracies, and standardizing data formats to guarantee that the training data is complete, consistent, and accurate. Not only does data cleaning improve data quality, but it can also decrease the dimensionality of the data, resulting in a simpler and more efficient classification model