Checks

When a repository event webhook is received, Observatory performs a series of automated scans. The Endpoint Scanner components of the Observatory system perform the actual scans on given endpoints.

ruok-architecture

Broadly speaking, these checsk can be broken into the following categories.

Check Type	Purpose	Strategy
Remote Repository Checks	Verify compliance with source code repositories on remotes such as GitHub.	GitHub Octokit API
Repository Content Checks	Perform scans on the contents of the repository.	Deep clone the repository and use automated scanning tools (e.g. Gitleaks).
URL Scanning Checks	Perform security and compliance checks against the live instance(s) of the product	Various automated scanning tools that interact with a public URL (e.g. axe-core for accessibility scanning).
Container Image Checks	Gather container image data from GCP.	GCP API

Observatory uses GraphQL as a layer to unify the data model for reporting on ITSG-33 and related compliance requirements. Roughly speaking, Observatory's "scanners" can be thought of as writing many pieces of security information about a given product. Similarly, the same data model exposed by the GraphQL API can be queried to report on the status of various compliance requirements.

GraphQL API

Note our assumption that one repository may deploy services behind multiple URLs and each repository may build more than one OCI image.

The sections below expand on each Check Type in greater detail, and also show the parts of our GraphQL schema that expose these Check Types.

Remote Repository Checks

There are many idioms, best practices, and security requirements for remote source code repositories. Observatory automatically performs a number of these checks using information retrievable from the GitHub Octokit API.

Repository Metadata

The following items are extracted with the GitHub Octokit API during each scan:

Data Example

{
    // ...
  endpoint: {
    url: "https://github.com/PHACDataHub/ruok-service-autochecker"
    kind: "Github"
    owner: "PHACDataHub"
    repo: "ruok-service-autochecker"
    description: "Automatically scans existing PHAC (GCP) services to provide visibility on endpoints and standards." 
    license: "MIT"
    visibility: "public"
    programmingLanguage: ["JavaScript", "Dockerfile", "Makefile", "HTML", "TypeScript", "CSS", "Python"]
  }
  // ...
}

Note - the programming languages are ordered most to least frequently occuring, and the plan is to use this for future language specific checks.

API

Note, this 'check' will need to be further developed in the future. At the moment, it searches a cloned GitHub repository for the presence of a directory containing case-insensitive 'api' anywhere in the string. Though this check collects the check_passes or metadata feilds at the moment it returns only a boolean, as some of these services aren't intended to have APIs.

This field can be used in the future to evaluate a segment of interoperability.

Data Example

{
  // ...
  api: true
  // ...
}

Vulnerability (Dependabot) Alerts Enabled

This checks whether GitHub (Dependabot) vunerability alerts have been enabled. The repository can be configured to have Dependabot alerts sent when your repository uses an insecure dependency. There's no metadata for this check.

Remediation

Follow these directions to configure the alerts for your project repository.

Data Example

{
  // ...
  vunerability_alerts: {
    check_passes: false,
    metadata: null
    }
  }
  // ...
}

Pass Criteria

Dependabot alerts are enabled.

Automated Security Fixes Enabled (Dependabot)

Dependabot can be configured to automatically pull request to resolve dependabot alerts if there's a patch.

Data Example

{
  // ...
  automatedSecurityFixes: {
    checkPasses: false
    metadata: {
      enabled: false
      paused: false
    }
  },
  // ...
}

Configure Dependabot to automaticallly create pull requests.

Pass Criteria

Automated security fixes alerts are enabled.

Branch Protection Enabled

Particular branches (generally master or main) can have protection through rules defined to restrict how code can be pushed to that branch. For example, this could require pull request reviews before merging.

This pulls out all the rules set to true, rrequiredDeploymentEnvironments if non-empty, requiredApprovingReviewCount if greater than 0, and pattern - which is the branch to which the rules are applied.

Remediation

Create a branch protection rule to prevent anyone from pushing to the main branch that is used for deployment.

Data Example

{
  // ...
    branchProtection: {
      checkPasses: true
      metadata:  {
        rules: [
          {
            branch: "main",
            dismissesStaleReviews: true,
            requiresApprovingReviews: true,
            requiresCommitSignatures: true,
            requiresConversationResolution: true,
            requiresLinearHistory: true,
            requiresStatusChecks: true,
            requiresStrictStatusChecks: true,
            restrictsPushes: true,
            requiredApprovingReviewCount: 1
          }
        ]
      }
    }
    // ... 
}

Pass Criteria

At least one branch protection rule has been found.

Repository Content Checks

A number of checks are performed by scanning a deep clone of the repository's contents. The purpose of these checks is to perform scanning on all of the source code, configuration, etc. contained in the repository.

Has `Security.md`

A best practice with any open source code repository is to provide instructions via a Security file at the project root for how security vulnerabilities should be reported (see GitHub's code security documentation for more information). This check verifies whether a repository contains a Security file.

Remediation

Include a file called Security.md at the root of your repository explaining how security vulnerabilities should be reported to the repository owner. For example, there may be an email address where vulnerability reports should be sent, and explicit instructions to not document security vulnerabilities in the repository issues.

Data Example

{
  // ...
  "has_security_md":{
      "check_passes": true,
      "metadata": (empty object)
  }
  // ...
}

Pass Criteria

A SECURITY.md, txt or rtf (non-case-sensitive) file is at the root of the GitHub repository. metadata for this will always be null.

Has `dependabot.yaml`

Dependabot is GitHub's dependency vunerability scanner. It uses the configuration outlined in the dependabot.yml file and usually resides in the .github folder.

Remediation

Add the dependabot.yml. Here's some documentation.

Data Example

{
  // ...
  "has_dependabot.yaml":{
      "check_passes": true,
      "metadata": null
  }
  // ...
}

Pass Criteria

A dependabot.yml or yaml resides in your GitHub repository. metadata will always be null. (Note - it is possible to turn dependabot on through the GitHub GUI - settings, security, but it's best practices to include the yml for full transparency.)

Gitleaks Report - secret scanning

Gitleaks detects if secrets that have been commited at any point in the repository's history.

Remediation

Remove the leak from the commit history.

If the 'leak' detected is not a secret, only a false positive, include a .gitleaksignore file at the root of your repository containing that item.

For preventative protection, consider using 'gitleaks protect' pre-commit or using the gitleaks GitHub Action.

Data Example

{
    // ...
  gitleaks: {
    check_passes: false,
    metadata: {
      leaks_found: true,
      number_of_leaks: 2,
      commits_scanned: 466,
      details: [
        {
            description: 'Private Key',
            file: 'scanners/github-cloned-repo-checks/src/fake-secret',
            start_line: 28,
            end_line: 28,
            start_column: 14,
            end_column: 53
            commit: '29c1850108f543f5eaab26ed052508fa0b45bb7',
            author: '=',
            email: 'my.email@gmail.com',
        },
     // ...
      ]
    }
  }
      // ...
}

Pass Criteria

No secrets have been found in history of repository.

`Hadolint` Dockerfile Linting

Hadolint is a linter for Dockerfiles. This scanner analyzes the Dockerfiles in the source code repository, and flags any best practices rules that have been broken.

Remediation

Follow the guidelines outlined in the results message to update the Dockerfiles. If your team has decided to not follow a particular rule in certain instances, you can clear the warning in this scanner by including an inline ignore tag at the Dockerfile location where you would like to have the rule check by-passed.

Data Example

{
    // ...
   hadolint: {
      check_passes: false
      metadata: [
        {
          dockerfile: "ui/Dockerfile",
          rules_violated: [
            {
                code: "DL1000",
                level: "error",
                line: 41,
                message: "unexpected '#'\nexpecting a new line followed by the next instruction"
            }
          ]
        },
        {
          dockerfile: "scanners/web-endpoint-checks/Dockerfile",
          rules_violated: [
            {
                code: "DL3008",
                level: "warning",
                line: 11,
                message: "Pin versions in apt get install. Instead of `apt-get install <package>` use `apt-get install <package>=<version>`"
            },
        // ...
          ]
        }
      ]
      // ...
    }

Pass Criteria

No linting rules broken for any of the Dockerfiles within the repository.

`Trivy` Repository Vunerability Scanning

Trivy is a security scanner we're using in this case to scan software dependencies against known vunerabilities. It offers a remote Git repository scanner, that works for public repositories. Since we have some private repositories, we're using the filesystem scan on the cloned repository instead.

Remediation

Update the dependencies as indicated if there is a fixed version. Follow the URL for more information on the found vunerability.

Data Example

{
    // ...
    trivy_repo_vulnerability: {
      check_passes: false
      metadata: [
        {
          library: "librarya",
          vulnerability_ID: "CVE-2023-xxxx",
          severity: "MEDIUM",
          installed_version: "41.0.x",
          fixed_version: "41.0.y",
          title: "librarya is a package designed to expose  ...",
          url: "https://avd.aquasec.com/nvd/cve-2023-xxxx"
        },
        {
          library: "librayb",
          vulnerability_ID: "GHSA-v8gr-xxxx-xxxx",
          severity: "LOW",
          installed_version: "41.0.x",
          fixed_version: "41.0.y",
          title: "Vulnerable OpenSSL included in libraryb",
          url: "https://github.com/advisories/GHSA-v8gr-xxxx-xxxx"
        },
        // ...
      ]
    }
  // ...
}

Pass Criteria

No vunerabilities captured by trivy have been found in the repository.

File Size Check

TODO

URL (Service) Scanning Checks

Some products have one or more services exposed through URLs. URL compliance checks perform a series of automated accessibility and security compliance checks using information that can be retrieved via these public URLs.

web-endpoint

Puppeteer is used for an accessibility scan. For the Observatory use case, if the accessibility check is non-applicable, check_passes is set to null,

Data Example (Though this is not complete, it runs through many, many checks.)

  accessibility: [
    {
      url: "https://some-webapp.canada.ca/about",
      areaAlt: {
          check_passes: null,
          metadata: {
              description: "Ensures <area> elements of image maps have alternate text",
              helpUrl: "https://dequeuniversity.com/rules/axe/4.8/area-alt?application=axe-puppeteer"
          }
      },
      ariaBrailleEquivalent: {
          check_passes: "false",
          metadata: {
              description: "Ensure aria-braillelabel and aria-brailleroledescription have a non-braille equivalent",
              helpUrl: "https://dequeuniversity.com/rules/axe/4.8/aria-braille-equivalent?application=axe-puppeteer"
          }
      },
      ariaCommandName: {
          check_passes: null,
          metadata: {
              description: "Ensures every ARIA button, link and menuitem has an accessible name",
              helpUrl: "https://dequeuniversity.com/rules/axe/4.8/aria-command-name?application=axe-puppeteer"
          }
      },
      ariaHiddenFocus: {
          check_passes: "true",
          metadata: {
              description: "Ensures aria-hidden elements are not focusable nor contain focusable elements",
              helpUrl: "https://dequeuniversity.com/rules/axe/4.8/aria-hidden-focus?application=axe-puppeteer"
          }
      },
      ariaMeterName: {
          check_passes: "incomplete",
          metadata: {
              description: "Ensures every ARIA meter node has an accessible name",
              helpUrl: "https://dequeuniversity.com/rules/axe/4.8/aria-meter-name?application=axe-puppeteer"
          }
      }
    }
  ]

Container Image Checks

Any products that build and deploy OCI images perform a series of checks on the built image artifact(s).

Checks

Remote Repository Checks

Repository Metadata

API

Vulnerability (Dependabot) Alerts Enabled

Automated Security Fixes Enabled (Dependabot)

Branch Protection Enabled

Repository Content Checks

Has Security.md

Has dependabot.yaml

Gitleaks Report - secret scanning

Hadolint Dockerfile Linting

Trivy Repository Vunerability Scanning

File Size Check

URL (Service) Scanning Checks

web-endpoint

Container Image Checks

Has `Security.md`

Has `dependabot.yaml`

`Hadolint` Dockerfile Linting

`Trivy` Repository Vunerability Scanning