dc.contributor.advisor | Alam, Md. Golam Rabiul | |
dc.contributor.author | Siddiqui, Md. Saiful Bari | |
dc.date.accessioned | 2025-02-05T08:36:30Z | |
dc.date.available | 2025-02-05T08:36:30Z | |
dc.date.copyright | ©2024 | |
dc.date.issued | 2024-10 | |
dc.identifier.other | ID 22166054 | |
dc.identifier.uri | http://hdl.handle.net/10361/25324 | |
dc.description | This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, 2024. | en_US |
dc.description | Cataloged from PDF version of thesis. | |
dc.description | Includes bibliographical references (pages 59-62). | |
dc.description.abstract | Overfitting remains a persistent challenge in deep learning, primarily attributed
to data outliers, noise, and limited training set sizes. This thesis presents Divide2Conquer
(D2C), a novel technique designed to address this issue. D2C
proposes partitioning the training data into multiple subsets and training separate
identical models on them. To avoid overfitting on any specific subset, the trained
parameters from these models are aggregated and averaged periodically throughout
the training phase, enabling the model to learn from the entire dataset while
mitigating the impact of individual outliers or noise. Empirical evaluations on multiple
benchmark datasets across various deep learning tasks from different domains
demonstrate that D2C effectively improves generalization performance, particularly
for larger datasets. This study verifies D2C’s ability to achieve significant performance
gains as a standalone technique and also when used in conjunction with other
overfitting reduction methods through a series of experiments, including analysis of
decision boundaries, loss curves, and other performance metrics. Additionally, we
provide a rigorous mathematical justification for our hypothesis and analyze the applicability
of the D2C method through extensive experimentation on various datasets
covering multiple domains. We also delve into the trade-offs associated with D2C
and explore strategies to mitigate these challenges, providing a comprehensive understanding
of D2C’s strengths and weaknesses. | en_US |
dc.description.statementofresponsibility | Md. Saiful Bari Siddiqui | |
dc.format.extent | 80 pages | |
dc.language.iso | en | en_US |
dc.publisher | BRAC University | en_US |
dc.rights | BRAC University theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. | |
dc.subject | Deep learning | en_US |
dc.subject | Divide2Conquer | en_US |
dc.subject | D2C | en_US |
dc.subject | Hyperparameter | en_US |
dc.subject.lcsh | Deep learning (Machine learning). | |
dc.title | Divide2Conquer (D2C): a comprehensive study on decentralized overfitting remediation in deep learning | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Department of Computer Science and Engineering, BRAC University | |
dc.description.degree | B.Sc. in Computer Science and Engineering | |