example
Q. Don’t CPE Information contain the product name information too? Why extract it from the Description section?
A. CVE-CPE matches are not always 100% synced; there are many cases where a CVE’s according CPE information is missing. On the other hand, the Description section always contains the information on what kind of vulnerability exists in which product.
Q. Why NER? Couldn’t we just build a ‘product-name-dictionary’ and do a simple search for the names in the Description?
A. First, vulnerabilities from new products are found and registered as a new CVE every day. If we did a simple search, it would require a manual job of going through the new CVEs and updating the dictionary with newly registered products. Second, the product name’s location in a sentence varies. Certain sentences may start with the product names, while some others start with the vulnerable version or the vendor’s name of the product.
Open Source NLP models
Apache OpenNLP
Stanford NLP
Training Dataset