School of Computer Science & Engineering Faculty Publications

Automated Extraction of Software Names from Vulnerability Reports using LSTM and Expert System

Igor Khokhlov, Sacred Heart UniversityFollow
Ahmet Okutan, Rochester Institute of Technology
Ryan Bryla, Rochester Institute of Technology
Steven Simmons, Rochester Institute of Technology
Mehdi Mirakhorli, Rochester Institute of Technology

Document Type

Conference Proceeding

Publication Date

10-2022

Abstract

Software vulnerabilities are closely monitored by the security community to timely address the security and privacy issues in software systems. Before a vulnerability is published by vulnerability management systems, it needs to be characterized to highlight its unique attributes, including affected software products and versions, to help security professionals prioritize their patches. Associating product names and versions with disclosed vulnerabilities may require a labor-intensive process that may delay their publication and fix, and thereby give attackers more time to exploit them. This work proposes a machine learning method to extract software product names and versions from unstructured CVE descriptions automatically. It uses Word2Vec and Char2Vec models to create context-aware features from CVE descriptions and uses these features to train a Named Entity Recognition (NER) model using bidirectional Long short-term memory (LSTM) networks. Based on the attributes of the product names and versions in previously published CVE descriptions, we created a set of Expert System (ES) rules to refine the predictions of the NER model and improve the performance of the developed method. Experiment results on real-life CVE examples indicate that using the trained NER model and the set of ES rules, software names and versions in unstructured CVE descriptions could be identified with F-Measure values above 0.95.

Comments

2022 IEEE 29th Annual Software Technology Conference (STC),Gaithersburg, MD.

DOI

10.1109/STC55697.2022.00024

Recommended Citation

Khokhlov, I., Okutan, A., Bryla, R., Simmons, S., & Mirakhorli, M. (2022, October). Automated extraction of software names from vulnerability reports using LSTM and expert system [Conference paper]. 2022 IEEE 29th Annual Software Technology Conference (STC) (pp. 125-134). https://doi.org/10.1109/STC55697.2022.00024

Link to Full Text

COinS

School of Computer Science & Engineering Faculty Publications

Automated Extraction of Software Names from Vulnerability Reports using LSTM and Expert System

Document Type

Publication Date

Abstract

Comments

DOI

Recommended Citation

Search

Browse

Author Corner

Links

School of Computer Science & Engineering Faculty Publications

Automated Extraction of Software Names from Vulnerability Reports using LSTM and Expert System

Authors

Document Type

Publication Date

Abstract

Comments

DOI

Recommended Citation

Share

Search

Browse

Author Corner

Links