Parlay
Enriching SBOMs
arbiter
will take a CycloneDX (JSON, XML) or SPDX 2.3 (JSON) document and enrich it with information taken from external services. At present this includes:
By enrich, we mean add additional information. You put in an SBOM, and you get a richer SBOM back. In many cases SBOMs have a minimum of information, often just the name and version of a given package. By enriching that with additional information we can make better decisions about the packages we're using.
Enriching with ecosyste.ms
Let's take a simple CycloneDX SBOM of a Javascript application. Using arbiter
we enrich it using data from ecosyste.ms, adding information about the package license, external links, the maintainer and more.
$ cat testing/sbom.cyclonedx.json
...
{
"bom-ref": "68-subtext@6.0.12",
"type": "library",
"name": "subtext",
"version": "6.0.12",
"purl": "pkg:npm/subtext@6.0.12"
}
...
$ cat testing/sbom.cyclonedx.json | arbiter ecosystems enrich - | jq
...
{
"bom-ref": "68-subtext@6.0.12",
"type": "library",
"supplier": {
"name": "hapi.js",
"url": [
"https://hapi.dev"
]
},
"author": "hapi.js",
"name": "subtext",
"version": "6.0.12",
"description": "HTTP payload parsing",
"licenses": [
{
"expression": "BSD-3-Clause"
}
],
"purl": "pkg:npm/subtext@6.0.12",
"externalReferences": [
{
"url": "https://github.com/hapijs/subtext",
"type": "website"
},
{
"url": "https://www.npmjs.com/package/subtext",
"type": "distribution"
},
{
"url": "https://github.com/hapijs/subtext",
"type": "vcs"
}
],
"properties": [
{
"name": "ecosystems:first_release_published_at",
"value": "2014-09-29T01:56:03Z"
},
{
"name": "ecosystems:latest_release_published_at",
"value": "2019-01-31T19:36:58Z"
}
]
}
...
What about with SPDX? Let's take an SBOM containing a list of packages like so:
{
"name": "concat-map",
"SPDXID": "SPDXRef-7-concat-map-0.0.1",
"versionInfo": "0.0.1",
"downloadLocation": "NOASSERTION",
"copyrightText": "NOASSERTION",
"externalRefs": [
{
"referenceCategory": "PACKAGE-MANAGER",
"referenceType": "purl",
"referenceLocator": "pkg:npm/concat-map@0.0.1"
}
]
}
Running arbiter ecosystems enrich <sbom.spdx.json>
will add additional information:
{
"name": "concat-map",
"SPDXID": "SPDXRef-7-concat-map-0.0.1",
"versionInfo": "0.0.1",
"downloadLocation": "NOASSERTION",
+ "homepage": "https://github.com/ljharb/concat-map",
+ "licenseConcluded": "MIT",
"copyrightText": "NOASSERTION",
+ "description": "concatenative mapdashery",
"externalRefs": [
{
"referenceCategory": "PACKAGE-MANAGER",
"referenceType": "purl",
"referenceLocator": "pkg:npm/concat-map@0.0.1"
}
]
There are a few other utility commands for ecosyste.ms as well. The first returns raw JSON information about a specific package from ecosyste.ms:
arbiter ecosystems package pkg:npm/khulnasoft
You can also return raw JSON information about a specific repository:
arbiter ecosystems repo https://github.com/open-policy-agent/conftest
Enriching with Khulnasoft
arbiter
can also enrich an SBOM with Vulnerability information from Khulnasoft.
It's important to note vulnerability data is moment-in-time information. By adding vulnerability information directly to the SBOM this makes the SBOM moment-in-time too.
Note the Khulnasoft commands require you to be a Khulnasoft customer, and require passing a valid Khulnasoft API token in the KHULNASOFT_TOKEN
environment variable.
arbiter khulnasoft enrich testing/sbom.cyclonedx.json
Khulnasoft will add a new vulnerability attribute to the SBOM, for example:
"vulnerabilities": [
{
"bom-ref": "68-subtext@6.0.12",
"id": "KHULNASOFT-JS-SUBTEXT-467257",
"ratings": [
{
"source": {
"name": "Khulnasoft",
"url": "https://security.khulnasoft.com"
},
"score": 7.5,
"severity": "high",
"method": "CVSSv31",
"vector": "CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H"
}
],
"cwes": [
400
],
"description": "Denial of Service (DoS)",
"detail": "...",
"advisories": [
{
"title": "GitHub Commit",
"url": "https://github.com/brave-intl/subtext/commit/9557c115b1384191a0d6e4a9ea028fedf8b44ae6"
},
{
"title": "GitHub Issue",
"url": "https://github.com/hapijs/subtext/issues/72"
},
{
"title": "NPM Security Advisory",
"url": "https://www.npmjs.com/advisories/1168"
}
],
"created": "2019-09-19T10:25:11Z",
"updated": "2020-12-14T14:41:09Z"
}
For SPDX, vulnerability informatio is added as additional externalRefs
:
{
"referenceCategory": "SECURITY",
"referenceType": "advisory",
"referenceLocator": "https://security.khulnasoft.com/vuln/KHULNASOFT-JS-MINIMATCH-3050818",
"comment": "Regular Expression Denial of Service (ReDoS)"
},
{
"referenceCategory": "SECURITY",
"referenceType": "advisory",
"referenceLocator": "https://security.khulnasoft.com/vuln/KHULNASOFT-JS-MINIMATCH-1019388",
"comment": "Regular Expression Denial of Service (ReDoS)"
}
Return raw JSON information about vulnerabilities in a specific package from Khulnasoft:
arbiter khulnasoft package pkg:npm/sqliter@1.0.1
Enriching with OpenSSF Scorecard
The OpenSSF Scorecard project tests various aspects of a projects security posture and provides a score. arbiter
supports added a link to this data with the arbiter scorecard enrich
command.
You can use this like so:
arbiter scorecard enrich testing/sbom2.cyclonedx.json
This will currently add an external reference to the Scorecard API which can be used to retrieve the full scorecard.
{
"bom-ref": "103-org.springframework:spring-webmvc@5.3.3",
"type": "library",
"name": "org.springframework:spring-webmvc",
"version": "5.3.3",
"purl": "pkg:maven/org.springframework/spring-webmvc@5.3.3",
"externalReferences": [
{
"url": "https://api.securityscorecards.dev/projects/github.com/spring-projects/spring-framework",
"comment": "OpenSSF Scorecard",
"type": "other"
}
]
},
We're currently looking at the best way of encoding some of the scorecard data in the SBOM itself as well.
What about enriching with other data sources?
There are lots of other sources of package data, and it would be great to add support for them in arbiter
. Please open issues and PRs with ideas.
Pipes!
arbiter
is a fan of stdin and stdout. You can pipe SBOMs from other tools into arbiter
, and pipe between the separate enrich
commands too.
Maybe you want to enrich an SBOM with both ecosyste.ms and Khulnasoft data:
cat testing/sbom.cyclonedx.json | ./arbiter e enrich - | ./arbiter s enrich - | jq
Maybe you want to take the output from Syft and add vulnerabilitity data?
syft -o cyclonedx-json nginx | arbiter s enrich - | jq
Maybe you want to geneate an SBOM with cdxgen
, enrich that with extra information, and test that with bomber
:
cdxgen -o | arbiter e enrich - | bomber scan --provider khulnasoft -
The ecosyste.ms enrichment adds license information, which Bomber then surfaces:
■ Ecosystems detected: gem
■ Scanning 18 packages for vulnerabilities...
■ Vulnerability Provider: Khulnasoft (https://security.khulnasoft.com)
■ Files Scanned
- (sha256:701770b2317ea8cbd03aa398ecb6a0381c85beaf24d46c45665b53331816e360)
■ Licenses Found: MIT, Apache-2.0, BSD-3-Clause, Ruby
Installation
arbiter
binaries are available from GitHub Releases. Just select the archive for your operating system and architecture. For instance, you could download for macOS ARM machines with the following, substituting {version}
for the latest version number, for instance 0.1.4
.
wget https://github.com/khulnasoft/arbiter/releases/download/v{version}/arbiter_Darwin_arm64.tar.gz
tar -xvf arbiter_Darwin_arm64.tar.gz
Supported package types
The various services used to enrich the SBOM data have data for a subset of purl types:
Ecosystems
apk
cargo
cocoapods
composer
gem
golang
hex
maven
npm
nuget
pypi
Khulnasoft
apk
cargo
cocoapods
composer
deb
gem
golang
hex
maven
npm
nuget
pypi
rpm
swift
OpenSSF Scorecard
apk
cargo
cocoapods
composer
gem
golang
hex
maven
npm
nuget
pypi
Note that Scorecard data is available only for a subset of projects from supported Git repositories. See the Scorecard project for more information.