Characterizing and Understanding Software Security Vulnerabilities in Machine Learning Libraries

Nima Shiri Harzevili; Jiho Shin; Junjie Wang; Song Wang; Nachiappan Nagappan

doi:10.1109/MSR59073.2023.00018

2023 IEEE/ACM 20th International Conference on Mining Software Repositories (MSR)

Characterizing and Understanding Software Security Vulnerabilities in Machine Learning Libraries

Year: 2023, Pages: 27-38

DOI Bookmark: 10.1109/MSR59073.2023.00018

Authors

Nima Shiri Harzevili, York University,Lassonde School of Engineering,Toronto,Canada
Jiho Shin, York University,Lassonde School of Engineering,Toronto,Canada
Junjie Wang, Institute of Software Chinese Academy of Sciences,Beijing,China
Song Wang, York University,Lassonde School of Engineering,Toronto,Canada
Nachiappan Nagappan, IIIT Delhi,New Delhi,India

Abstract

The application of machine learning (ML) libraries has tremendously increased in many domains, including autonomous driving systems, medical, and critical industries. Vulnerabilities of such libraries could result in irreparable consequences. However, the characteristics of software security vulnerabilities have not been well studied. In this paper, to bridge this gap, we take the first step toward characterizing and understanding the security vulnerabilities of seven well-known ML libraries, including TensorFlow, PyTorch, Scikit-learn, Mlpack, Pandas, Numpy, and Scipy. To do so, we collected 683 security vulnerabilities to explore four major factors: 1) vulnerability types, 2) root causes, 3) symptoms, and 4) fixing patterns of security vulnerabilities in the studied ML libraries. The findings of this study can help developers and researchers understand the characteristics of security vulnerabilities across the studied ML libraries.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

Assessing the Threat Landscape for Software Libraries
2014 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)
Understanding the Origins of Mobile App Vulnerabilities: A Large-Scale Measurement Study of Free and Paid Apps
2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR)
Understanding the Threats of Upstream Vulnerabilities to Downstream Projects in the Maven Ecosystem
2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)
Compatible Remediation on Vulnerabilities from Third-Party Libraries for Java Projects
2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)
Automatic Static Vulnerability Detection for Machine Learning Libraries: Are We There Yet?
2023 IEEE 34th International Symposium on Software Reliability Engineering (ISSRE)
Identifying Affected Libraries and Their Ecosystems for Open Source Software Vulnerabilities
2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE)
Identifying Affected Libraries and Their Ecosystems for Open Source Software Vulnerabilities
2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE)
Laughter in the Wild: A Study Into DoS Vulnerabilities in YAML Libraries
2019 18th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/13th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE)
Toward Automated Exploit Generation for Known Vulnerabilities in Open-Source Libraries
2021 IEEE/ACM 29th International Conference on Program Comprehension (ICPC)
PDGraph: A Large-Scale Empirical Study on Project Dependency of Security Vulnerabilities
2021 51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)

Characterizing and Understanding Software Security Vulnerabilities in Machine Learning Libraries

Authors

Abstract

Related Articles