Divide2Conquer (D2C): a comprehensive study on decentralized overfitting remediation in deep learning

Siddiqui, Md. Saiful Bari

dc.contributor.advisor	Alam, Md. Golam Rabiul
dc.contributor.author	Siddiqui, Md. Saiful Bari
dc.date.accessioned	2025-02-05T08:36:30Z
dc.date.available	2025-02-05T08:36:30Z
dc.date.copyright	©2024
dc.date.issued	2024-10
dc.identifier.other	ID 22166054
dc.identifier.uri	http://hdl.handle.net/10361/25324
dc.description	This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2024.	en_US
dc.description	Cataloged from PDF version of thesis.
dc.description	Includes bibliographical references (pages 59-62).
dc.description.abstract	Overfitting remains a persistent challenge in deep learning, primarily attributed to data outliers, noise, and limited training set sizes. This thesis presents Divide2Conquer (D2C), a novel technique designed to address this issue. D2C proposes partitioning the training data into multiple subsets and training separate identical models on them. To avoid overfitting on any specific subset, the trained parameters from these models are aggregated and averaged periodically throughout the training phase, enabling the model to learn from the entire dataset while mitigating the impact of individual outliers or noise. Empirical evaluations on multiple benchmark datasets across various deep learning tasks from different domains demonstrate that D2C effectively improves generalization performance, particularly for larger datasets. This study verifies D2C’s ability to achieve significant performance gains as a standalone technique and also when used in conjunction with other overfitting reduction methods through a series of experiments, including analysis of decision boundaries, loss curves, and other performance metrics. Additionally, we provide a rigorous mathematical justification for our hypothesis and analyze the applicability of the D2C method through extensive experimentation on various datasets covering multiple domains. We also delve into the trade-offs associated with D2C and explore strategies to mitigate these challenges, providing a comprehensive understanding of D2C’s strengths and weaknesses.	en_US
dc.description.statementofresponsibility	Md. Saiful Bari Siddiqui
dc.format.extent	80 pages
dc.language.iso	en	en_US
dc.publisher	BRAC University	en_US
dc.rights	BRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission.
dc.subject	Deep learning	en_US
dc.subject	Divide2Conquer	en_US
dc.subject	D2C	en_US
dc.subject	Hyperparameter	en_US
dc.subject.lcsh	Deep learning (Machine learning).
dc.title	Divide2Conquer (D2C): a comprehensive study on decentralized overfitting remediation in deep learning	en_US
dc.type	Thesis	en_US
dc.contributor.department	Department of Computer Science and Engineering, BRAC University
dc.description.degree	B.Sc. in Computer Science and Engineering

Files in this item

Name:: 22166054_CSE.pdf
Size:: 1.146Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Thesis & Report, BSc (Computer Science and Engineering) [1583]

Show simple item record