Search for a command to run...
SMELLDroid is a large-scale dataset designed to support empirical research on software quality and maintainability within the Android ecosystem. The dataset comprises 38,704 Android applications, including 29,201 malware samples and 9,503 benign apps, analyzed using a static analysis pipeline to identify and quantify code smells. SMELLDroid captures seven distinct code smell types, encompassing four object-oriented smells (Blob Class, Complex Class, Long Method, Swiss Army Knife) and three Android-specific smells (Heavy AsyncTask, Heavy Broadcast Receiver, Heavy Service). For each application, quantitative indicators describe the presence and frequency of these smells, enabling systematic measurement and comparison across large app populations. The dataset supports research on code smell prevalence, relationships among smell categories, software quality analysis, and comparative studies of maintainability characteristics between benign and malicious Android applications. If you use this dataset, please cite the associated paper describing the SMELLDroid dataset. @inproceedings{SMELLDroid, author={Joyce Champie, Karim Elish, and Mahmoud Elish}, booktitle={23rd IEEE/ACM International Conference on Mining Software Repositories (MSR)}, title={SMELLDroid: A Dataset for Code Smells in Android Apps}, year={2026}} App and Malware Access Information: The apps and malware samples referenced in this dataset are real-world Android applications. To adhere to research ethics guidelines and comply with data sharing and redistribution policies, we do not directly distribute APK files. Instead, researchers are required to obtain the applications through the official AndroZoo repository and/or Drebin. AndroZoo: https://androzoo.uni.lu/ Drebin: https://drebin.mlsec.org/ Usage Instructions: Request access to the repository by following the instructions provided on the official website. Use the cryptographic hash values (e.g., SHA-256) included in our dataset to retrieve the corresponding application samples from the repository.