CodeQL documentation

database analyze

Synopsis

codeql database analyze --format=<format> --output=<output> [--threads=<num>] [--ram=<MB>] <options>... [--] <database> <query|dir|suite>...

Description

Analyze a database, producing meaningful results in the context of the source code.

Run a query suite (or some individual queries) against a CodeQL database, producing results, styled as alerts or paths, in SARIF or another interpreted format.

This command combines the effect of the codeql database run-queries and codeql database interpret-results commands. If you want to run queries whose results don’t meet the requirements for being interpreted as source-code alerts, use codeql database run-queries or codeql query run instead, and then codeql bqrs decode to convert the raw results to a readable notation.

Options

<database>

[Mandatory] Path to the CodeQL database to query.

<query|dir|suite>...

[Mandatory] Queries to execute. Each argument is one of:

  • A .ql file to execute.
  • A directory which will be searched recursively for .ql files.
  • A .qls file that defines a particular set of queries.
  • The basename of a “well-known” .qls file exported by one of the installed QL packs.
--format=<format>

[Mandatory] The format in which to write the results. One of:

csv: Formatted comma-separated values, including columns with both rule and alert metadata.

sarif-latest: Static Analysis Results Interchange Format (SARIF), a JSON-based format for describing static analysis results. This format option uses the most recent supported version (v2.1.0). This option is not suitable for use in automation as it will produce different versions of SARIF between different CodeQL versions.

sarifv1: SARIF v1.0.0.

sarifv2: SARIF v2.0.0 (Committee Specification Draft 1).

sarifv2.1.0: SARIF v2.1.0.

graphtext: A textual format representing a graph. Only compatible with queries with @kind graph.

dgml: Directed Graph Markup Language, an XML-based format for describing graphs. Only compatible with queries with @kind graph.

-o, --output=<output>

[Mandatory] The output path to write results to. For graph formats this should be a directory, where one result will be written per query.

--[no-]rerun

Evaluate even queries that seem to have a BQRS result stored in the database already.

--max-paths=<maxPaths>

The maximum number of paths to produce for each alert with paths. (Default: 4)

--[no-]sarif-add-file-contents

[SARIF v2 formats only] Include the full file contents for all files referenced in at least one result.

--[no-]sarif-add-snippets

[SARIF v2.1.0 and later only] Include code snippets for each location mentioned in the results, with two lines of context before and after the reported location.

--[no-]sarif-multicause-markdown

[SARIF v2.1.0 and later only] For akerts that have multiple causes, include them as a Markdown-formatted itemized list in the output in addition to as a plain string.

--no-group-results

[SARIF formats only] Produce one result per message, rather than one result per unique location.

--csv-location-format=<csvLocationFormat>

The format in which to produce locations in CSV output. One of: uri, line-column, offset-length. (Default: line-column)

Options to control the query evaluator

--[no-]tuple-counting

[Advanced] Include tuple counts for each evaluation step in the query evaluator logs. (This can be useful for performance optimization of complex QL code).

--timeout=<seconds>

[Advanced] Set the timeout length for query evaluation, in seconds.

The timeout feature is intended to catch cases where a complex query would take “forever” to evaluate. It is not an effective way to limit the total amount of time the query evaluation can take. The evaluation will be allowed to continue as long as each separately timed part of the computation completes within the timeout. Currently these separately timed parts are “RA stages” of the optimized query, but that might change in the future.

If no timeout is specified, or is given as 0, no timeout will be set (except for codeql test run where the default timeout is 5 minutes).

-j, --threads=<num>

Use this many threads to evaluate queries.

Defaults to 1. You can pass 0 to use one thread per core on the machine, or -N to leave N cores unused (except still use at least one thread).

--[no-]save-cache

[Advanced] Aggressively write intermediate results to the disk cache. This takes more time and uses (much) more disk space, but may speed up the subsequent execution of similar queries.

--[no-]keep-full-cache

[Advanced] Don’t clean up the disk cache after evaluation completes. This may save time if you’re going to do codeql dataset cleanup or codeql database cleanup afterwards anyway.

--max-disk-cache=<MB>

Set the maximum amount of space the disk cache for intermediate query results can use.

If this size is not configured explicitly, the evaluator will try to use a “reasonable” amount of cache space, based on the size of the dataset and the complexity of the queries. Explicitly setting a higher limit than this default usage will enable additional caching which can speed up later queries.

--min-disk-free=<MB>

[Advanced] Set target amount of free space on file system.

If --max-disk-cache is not given, the evaluator will try hard to curtail disk cache usage if the free space on the file system drops below this value.

--min-disk-free-pct=<pct>

[Advanced] Set target fraction of free space on file system.

If --max-disk-cache is not given, the evaluator will try hard to curtail disk cache usage if the free space on the file system drops below this percentage.

--external=<pred>=<file.csv>

A CSV file that contains rows for external predicate <pred>. Multiple --external options can be supplied.

Options to control RAM usage

-M, --ram=<MB>

Set total amount of RAM the query evaluator should be allowed to use.

Options to control QL compilation

--warnings=<mode>

How to handle warnings from the QL compiler. One of:

hide: Suppress warnings.

show (default): Print warnings but continue with compilation.

error: Treat warnings as errors.

--[no-]fast-compilation

[Advanced] Omit particularly slow optimization steps.

--[no-]local-checking

Only perform initial checks on the part of the QL source that is used.

--no-metadata-verification

Don’t check embedded query metadata in QLDoc comments for validity.

--compilation-cache-size=<MB>

[Advanced] Override the default maximum size for a compilation cache directory.

Options to set up compilation environment

--search-path=<dir>[:<dir>...]

A list of directories under which QL packs may be found. Each directory can either be a QL pack (or bundle of packs containing a .codeqlmanifest.json file at the root) or the immediate parent of one or more such directories.

If the path contains more than directory, their order defines precedence between them: when a pack name that must be resolved is matched in more than one of the directory trees, the one given first wins.

Pointing this at a checkout of the open-source CodeQL repository ought to work when querying one of the languages that live there.

If you have have checked out the CodeQL reposity as a sibling of the unpacked CodeQL toolchain, you don’t need to give this option; such sibling directories will always be searched for QL packs that cannot be found otherwise. (If this default does not work, it is strongly recommended to set up --search-path once and for all in a per-user configuration file).

(Note: On Windows the path separator is ;).

--additional-packs=<dir>[:<dir>...]

If this list of directories is given, they will be searched for packs before the ones in --search-path. The order between these doesn’t matter; it is an error if a pack name is found in two different places through this list.

This is useful if you’re temporarily developing a new version of a pack that also appears in the default path. On the other hand it is not recommended to override this option in a config file; some internal actions will add this option on the fly, overriding any configured value.

(Note: On Windows the path separator is ;).

--library-path=<dir>[:<dir>...]

[Advanced] An optional list of directories that will be added to the raw import search path for QL libraries. This should only be used if you’re using QL libraries that have not been packaged as QL packs.

(Note: On Windows the path separator is ;).

--dbscheme=<file>

[Advanced] Explicitly define which dbscheme queries should be compiled against. This should only be given by callers that are extremely sure what they’re doing.

--compilation-cache=<dir>

[Advanced] Specify an additional directory to use as a compilation cache.

--no-default-compilation-cache

[Advanced] Don’t use compilation caches in standard locations such as in the QL pack containing the query or in the CodeQL toolchain directory.

Common options

-h, --help

Show this help text.

-J=<opt>

[Advanced] Give option to the JVM running the command.

(Beware that options containing spaces will not be handled correctly.)

-v, --verbose

Incrementally increase the number of progress messages printed.

-q, --quiet

Incrementally decrease the number of progress messages printed.

--verbosity=<level>

[Advanced] Explicitly set the verbosity level to one of errors, warnings, progress, progress+, progress++, progress+++. Overrides -v and -q.

--logdir=<dir>

[Advanced] Write detailed logs to one or more files in the given directory, with generated names that include timestamps and the name of the running subcommand.

(To write a log file with a name you have full control over, instead give --log-to-stderr and redirect stderr as desired.)