gcc/doc/analyzer.texi

1.1.1.2  mrg @c Copyright (C) 2019-2022 Free Software Foundation, Inc.
    1.1  mrg @c This is part of the GCC manual.
    1.1  mrg @c For copying conditions, see the file gcc.texi.
    1.1  mrg @c Contributed by David Malcolm <dmalcolm (a] redhat.com>.
    1.1  mrg
    1.1  mrg @node Static Analyzer
    1.1  mrg @chapter Static Analyzer
    1.1  mrg @cindex analyzer
    1.1  mrg @cindex static analysis
    1.1  mrg @cindex static analyzer
    1.1  mrg
    1.1  mrg @menu
    1.1  mrg * Analyzer Internals::       Analyzer Internals
    1.1  mrg * Debugging the Analyzer::   Useful debugging tips
    1.1  mrg @end menu
    1.1  mrg
    1.1  mrg @node Analyzer Internals
    1.1  mrg @section Analyzer Internals
    1.1  mrg @cindex analyzer, internals
    1.1  mrg @cindex static analyzer, internals
    1.1  mrg
    1.1  mrg @subsection Overview
    1.1  mrg
    1.1  mrg The analyzer implementation works on the gimple-SSA representation.
    1.1  mrg (I chose this in the hopes of making it easy to work with LTO to
    1.1  mrg do whole-program analysis).
    1.1  mrg
    1.1  mrg The implementation is read-only: it doesn't attempt to change anything,
    1.1  mrg just emit warnings.
    1.1  mrg
    1.1  mrg The gimple representation can be seen using @option{-fdump-ipa-analyzer}.
1.1.1.2  mrg @quotation Tip
1.1.1.2  mrg If the analyzer ICEs before this is written out, one workaround is to use
1.1.1.2  mrg @option{--param=analyzer-bb-explosion-factor=0} to force the analyzer
1.1.1.2  mrg to bail out after analyzing the first basic block.
1.1.1.2  mrg @end quotation
    1.1  mrg
    1.1  mrg First, we build a @code{supergraph} which combines the callgraph and all
    1.1  mrg of the CFGs into a single directed graph, with both interprocedural and
    1.1  mrg intraprocedural edges.  The nodes and edges in the supergraph are called
    1.1  mrg ``supernodes'' and ``superedges'', and often referred to in code as
    1.1  mrg @code{snodes} and @code{sedges}.  Basic blocks in the CFGs are split at
    1.1  mrg interprocedural calls, so there can be more than one supernode per
    1.1  mrg basic block.  Most statements will be in just one supernode, but a call
    1.1  mrg statement can appear in two supernodes: at the end of one for the call,
    1.1  mrg and again at the start of another for the return.
    1.1  mrg
    1.1  mrg The supergraph can be seen using @option{-fdump-analyzer-supergraph}.
    1.1  mrg
    1.1  mrg We then build an @code{analysis_plan} which walks the callgraph to
    1.1  mrg determine which calls might be suitable for being summarized (rather
    1.1  mrg than fully explored) and thus in what order to explore the functions.
    1.1  mrg
    1.1  mrg Next is the heart of the analyzer: we use a worklist to explore state
    1.1  mrg within the supergraph, building an "exploded graph".
    1.1  mrg Nodes in the exploded graph correspond to <point,@w{ }state> pairs, as in
    1.1  mrg      "Precise Interprocedural Dataflow Analysis via Graph Reachability"
    1.1  mrg      (Thomas Reps, Susan Horwitz and Mooly Sagiv).
    1.1  mrg
    1.1  mrg We reuse nodes for <point, state> pairs we've already seen, and avoid
    1.1  mrg tracking state too closely, so that (hopefully) we rapidly converge
    1.1  mrg on a final exploded graph, and terminate the analysis.  We also bail
    1.1  mrg out if the number of exploded <end-of-basic-block, state> nodes gets
    1.1  mrg larger than a particular multiple of the total number of basic blocks
    1.1  mrg (to ensure termination in the face of pathological state-explosion
    1.1  mrg cases, or bugs).  We also stop exploring a point once we hit a limit
    1.1  mrg of states for that point.
    1.1  mrg
    1.1  mrg We can identify problems directly when processing a <point,@w{ }state>
    1.1  mrg instance.  For example, if we're finding the successors of
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg    <point: before-stmt: "free (ptr);",
    1.1  mrg     state: @{"ptr": freed@}>
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg then we can detect a double-free of "ptr".  We can then emit a path
    1.1  mrg to reach the problem by finding the simplest route through the graph.
    1.1  mrg
    1.1  mrg Program points in the analysis are much more fine-grained than in the
    1.1  mrg CFG and supergraph, with points (and thus potentially exploded nodes)
    1.1  mrg for various events, including before individual statements.
    1.1  mrg By default the exploded graph merges multiple consecutive statements
    1.1  mrg in a supernode into one exploded edge to minimize the size of the
    1.1  mrg exploded graph.  This can be suppressed via
    1.1  mrg @option{-fanalyzer-fine-grained}.
    1.1  mrg The fine-grained approach seems to make things simpler and more debuggable
    1.1  mrg that other approaches I tried, in that each point is responsible for one
    1.1  mrg thing.
    1.1  mrg
    1.1  mrg Program points in the analysis also have a "call string" identifying the
    1.1  mrg stack of callsites below them, so that paths in the exploded graph
    1.1  mrg correspond to interprocedurally valid paths: we always return to the
    1.1  mrg correct call site, propagating state information accordingly.
    1.1  mrg We avoid infinite recursion by stopping the analysis if a callsite
    1.1  mrg appears more than @code{analyzer-max-recursion-depth} in a callstring
    1.1  mrg (defaulting to 2).
    1.1  mrg
    1.1  mrg @subsection Graphs
    1.1  mrg
    1.1  mrg Nodes and edges in the exploded graph are called ``exploded nodes'' and
    1.1  mrg ``exploded edges'' and often referred to in the code as
    1.1  mrg @code{enodes} and @code{eedges} (especially when distinguishing them
    1.1  mrg from the @code{snodes} and @code{sedges} in the supergraph).
    1.1  mrg
    1.1  mrg Each graph numbers its nodes, giving unique identifiers - supernodes
    1.1  mrg are referred to throughout dumps in the form @samp{SN': @var{index}} and
    1.1  mrg exploded nodes in the form @samp{EN: @var{index}} (e.g. @samp{SN: 2} and
    1.1  mrg @samp{EN:29}).
    1.1  mrg
    1.1  mrg The supergraph can be seen using @option{-fdump-analyzer-supergraph-graph}.
    1.1  mrg
    1.1  mrg The exploded graph can be seen using @option{-fdump-analyzer-exploded-graph}
    1.1  mrg and other dump options.  Exploded nodes are color-coded in the .dot output
    1.1  mrg based on state-machine states to make it easier to see state changes at
    1.1  mrg a glance.
    1.1  mrg
    1.1  mrg @subsection State Tracking
    1.1  mrg
    1.1  mrg There's a tension between:
    1.1  mrg @itemize @bullet
    1.1  mrg @item
    1.1  mrg precision of analysis in the straight-line case, vs
    1.1  mrg @item
    1.1  mrg exponential blow-up in the face of control flow.
    1.1  mrg @end itemize
    1.1  mrg
    1.1  mrg For example, in general, given this CFG:
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg       A
    1.1  mrg      / \
    1.1  mrg     B   C
    1.1  mrg      \ /
    1.1  mrg       D
    1.1  mrg      / \
    1.1  mrg     E   F
    1.1  mrg      \ /
    1.1  mrg       G
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg we want to avoid differences in state-tracking in B and C from
    1.1  mrg leading to blow-up.  If we don't prevent state blowup, we end up
    1.1  mrg with exponential growth of the exploded graph like this:
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg
    1.1  mrg            1:A
    1.1  mrg           /   \
    1.1  mrg          /     \
    1.1  mrg         /       \
    1.1  mrg       2:B       3:C
    1.1  mrg        |         |
    1.1  mrg       4:D       5:D        (2 exploded nodes for D)
    1.1  mrg      /   \     /   \
    1.1  mrg    6:E   7:F 8:E   9:F
    1.1  mrg     |     |   |     |
    1.1  mrg    10:G 11:G 12:G  13:G    (4 exploded nodes for G)
    1.1  mrg
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg Similar issues arise with loops.
    1.1  mrg
    1.1  mrg To prevent this, we follow various approaches:
    1.1  mrg
    1.1  mrg @enumerate a
    1.1  mrg @item
    1.1  mrg state pruning: which tries to discard state that won't be relevant
    1.1  mrg later on withing the function.
    1.1  mrg This can be disabled via @option{-fno-analyzer-state-purge}.
    1.1  mrg
    1.1  mrg @item
    1.1  mrg state merging.  We can try to find the commonality between two
    1.1  mrg program_state instances to make a third, simpler program_state.
    1.1  mrg We have two strategies here:
    1.1  mrg
    1.1  mrg   @enumerate
    1.1  mrg   @item
    1.1  mrg      the worklist keeps new nodes for the same program_point together,
    1.1  mrg      and tries to merge them before processing, and thus before they have
    1.1  mrg      successors.  Hence, in the above, the two nodes for D (4 and 5) reach
    1.1  mrg      the front of the worklist together, and we create a node for D with
    1.1  mrg      the merger of the incoming states.
    1.1  mrg
    1.1  mrg   @item
    1.1  mrg      try merging with the state of existing enodes for the program_point
    1.1  mrg      (which may have already been explored).  There will be duplication,
    1.1  mrg      but only one set of duplication; subsequent duplicates are more likely
    1.1  mrg      to hit the cache.  In particular, (hopefully) all merger chains are
    1.1  mrg      finite, and so we guarantee termination.
    1.1  mrg      This is intended to help with loops: we ought to explore the first
    1.1  mrg      iteration, and then have a "subsequent iterations" exploration,
    1.1  mrg      which uses a state merged from that of the first, to be more abstract.
    1.1  mrg   @end enumerate
    1.1  mrg
    1.1  mrg We avoid merging pairs of states that have state-machine differences,
    1.1  mrg as these are the kinds of differences that are likely to be most
    1.1  mrg interesting.  So, for example, given:
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg       if (condition)
    1.1  mrg         ptr = malloc (size);
    1.1  mrg       else
    1.1  mrg         ptr = local_buf;
    1.1  mrg
    1.1  mrg       .... do things with 'ptr'
    1.1  mrg
    1.1  mrg       if (condition)
    1.1  mrg         free (ptr);
    1.1  mrg
    1.1  mrg       ...etc
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg then we end up with an exploded graph that looks like this:
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg
    1.1  mrg                    if (condition)
    1.1  mrg                      / T      \ F
    1.1  mrg             ---------          ----------
    1.1  mrg            /                             \
    1.1  mrg       ptr = malloc (size)             ptr = local_buf
    1.1  mrg           |                               |
    1.1  mrg       copy of                         copy of
    1.1  mrg         "do things with 'ptr'"          "do things with 'ptr'"
    1.1  mrg       with ptr: heap-allocated        with ptr: stack-allocated
    1.1  mrg           |                               |
    1.1  mrg       if (condition)                  if (condition)
    1.1  mrg           | known to be T                 | known to be F
    1.1  mrg       free (ptr);                         |
    1.1  mrg            \                             /
    1.1  mrg             -----------------------------
    1.1  mrg                          | ('ptr' is pruned, so states can be merged)
    1.1  mrg                         etc
    1.1  mrg
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg where some duplication has occurred, but only for the places where the
    1.1  mrg the different paths are worth exploringly separately.
    1.1  mrg
    1.1  mrg Merging can be disabled via @option{-fno-analyzer-state-merge}.
    1.1  mrg @end enumerate
    1.1  mrg
    1.1  mrg @subsection Region Model
    1.1  mrg
    1.1  mrg Part of the state stored at a @code{exploded_node} is a @code{region_model}.
    1.1  mrg This is an implementation of the region-based ternary model described in
1.1.1.2  mrg @url{https://www.researchgate.net/publication/221430855_A_Memory_Model_for_Static_Analysis_of_C_Programs,
    1.1  mrg "A Memory Model for Static Analysis of C Programs"}
    1.1  mrg (Zhongxing Xu, Ted Kremenek, and Jian Zhang).
    1.1  mrg
    1.1  mrg A @code{region_model} encapsulates a representation of the state of
1.1.1.2  mrg memory, with a @code{store} recording a binding between @code{region}
1.1.1.2  mrg instances, to @code{svalue} instances.  The bindings are organized into
1.1.1.2  mrg clusters, where regions accessible via well-defined pointer arithmetic
1.1.1.2  mrg are in the same cluster.  The representation is graph-like because values
1.1.1.2  mrg can be pointers to regions.  It also stores a constraint_manager,
1.1.1.2  mrg capturing relationships between the values.
    1.1  mrg
    1.1  mrg Because each node in the @code{exploded_graph} has a @code{region_model},
    1.1  mrg and each of the latter is graph-like, the @code{exploded_graph} is in some
    1.1  mrg ways a graph of graphs.
    1.1  mrg
1.1.1.2  mrg Here's an example of printing a @code{program_state}, showing the
1.1.1.2  mrg @code{region_model} within it, along with state for the @code{malloc}
1.1.1.2  mrg state machine.
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg (gdb) call debug (*this)
1.1.1.2  mrg rmodel:
1.1.1.2  mrg stack depth: 1
1.1.1.2  mrg   frame (index 0): frame: test@@1
1.1.1.2  mrg clusters within frame: test@@1
1.1.1.2  mrg   cluster for: ptr_3: &HEAP_ALLOCATED_REGION(12)
1.1.1.2  mrg m_called_unknown_fn: FALSE
1.1.1.2  mrg constraint_manager:
    1.1  mrg   equiv classes:
    1.1  mrg   constraints:
1.1.1.2  mrg malloc:
1.1.1.2  mrg   0x2e89590: &HEAP_ALLOCATED_REGION(12): unchecked ('ptr_3')
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg This is the state at the point of returning from @code{calls_malloc} back
    1.1  mrg to @code{test} in the following:
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg void *
    1.1  mrg calls_malloc (void)
    1.1  mrg @{
    1.1  mrg   void *result = malloc (1024);
    1.1  mrg   return result;
    1.1  mrg @}
    1.1  mrg
    1.1  mrg void test (void)
    1.1  mrg @{
    1.1  mrg   void *ptr = calls_malloc ();
    1.1  mrg   /* etc.  */
    1.1  mrg @}
    1.1  mrg @end smallexample
    1.1  mrg
1.1.1.2  mrg Within the store, there is the cluster for @code{ptr_3} within the frame
1.1.1.2  mrg for @code{test}, where the whole cluster is bound to a pointer value,
1.1.1.2  mrg pointing at @code{HEAP_ALLOCATED_REGION(12)}.  Additionally, this pointer
1.1.1.2  mrg has the @code{unchecked} state for the @code{malloc} state machine
1.1.1.2  mrg indicating it hasn't yet been checked against NULL since the allocation
1.1.1.2  mrg call.
    1.1  mrg
    1.1  mrg @subsection Analyzer Paths
    1.1  mrg
    1.1  mrg We need to explain to the user what the problem is, and to persuade them
    1.1  mrg that there really is a problem.  Hence having a @code{diagnostic_path}
    1.1  mrg isn't just an incidental detail of the analyzer; it's required.
    1.1  mrg
    1.1  mrg Paths ought to be:
    1.1  mrg @itemize @bullet
    1.1  mrg @item
    1.1  mrg interprocedurally-valid
    1.1  mrg @item
    1.1  mrg feasible
    1.1  mrg @end itemize
    1.1  mrg
    1.1  mrg Without state-merging, all paths in the exploded graph are feasible
1.1.1.2  mrg (in terms of constraints being satisfied).
    1.1  mrg With state-merging, paths in the exploded graph can be infeasible.
    1.1  mrg
    1.1  mrg We collate warnings and only emit them for the simplest path
    1.1  mrg e.g. for a bug in a utility function, with lots of routes to calling it,
    1.1  mrg we only emit the simplest path (which could be intraprocedural, if
1.1.1.2  mrg it can be reproduced without a caller).
1.1.1.2  mrg
1.1.1.2  mrg We thus want to find the shortest feasible path through the exploded
1.1.1.2  mrg graph from the origin to the exploded node at which the diagnostic was
1.1.1.2  mrg saved.  Unfortunately, if we simply find the shortest such path and
1.1.1.2  mrg check if it's feasible we might falsely reject the diagnostic, as there
1.1.1.2  mrg might be a longer path that is feasible.  Examples include the cases
1.1.1.2  mrg where the diagnostic requires us to go at least once around a loop for a
1.1.1.2  mrg later condition to be satisfied, or where for a later condition to be
1.1.1.2  mrg satisfied we need to enter a suite of code that the simpler path skips.
1.1.1.2  mrg
1.1.1.2  mrg We attempt to find the shortest feasible path to each diagnostic by
1.1.1.2  mrg first constructing a ``trimmed graph'' from the exploded graph,
1.1.1.2  mrg containing only those nodes and edges from which there are paths to
1.1.1.2  mrg the target node, and using Dijkstra's algorithm to order the trimmed
1.1.1.2  mrg nodes by minimal distance to the target.
1.1.1.2  mrg
1.1.1.2  mrg We then use a worklist to iteratively build a ``feasible graph''
1.1.1.2  mrg (actually a tree), capturing the pertinent state along each path, in
1.1.1.2  mrg which every path to a ``feasible node'' is feasible by construction,
1.1.1.2  mrg restricting ourselves to the trimmed graph to ensure we stay on target,
1.1.1.2  mrg and ordering the worklist so that the first feasible path we find to the
1.1.1.2  mrg target node is the shortest possible path.  Hence we start by trying the
1.1.1.2  mrg shortest possible path, but if that fails, we explore progressively
1.1.1.2  mrg longer paths, eventually trying iterations through loops.  The
1.1.1.2  mrg exploration is captured in the feasible_graph, which can be dumped as a
1.1.1.2  mrg .dot file via @option{-fdump-analyzer-feasibility} to visualize the
1.1.1.2  mrg exploration.  The indices of the feasible nodes show the order in which
1.1.1.2  mrg they were created.  We effectively explore the tree of feasible paths in
1.1.1.2  mrg order of shortest path until we either find a feasible path to the
1.1.1.2  mrg target node, or hit a limit and give up.
1.1.1.2  mrg
1.1.1.2  mrg This is something of a brute-force approach, but the trimmed graph
1.1.1.2  mrg hopefully keeps the complexity manageable.
1.1.1.2  mrg
1.1.1.2  mrg This algorithm can be disabled (for debugging purposes) via
1.1.1.2  mrg @option{-fno-analyzer-feasibility}, which simply uses the shortest path,
1.1.1.2  mrg and notes if it is infeasible.
1.1.1.2  mrg
1.1.1.2  mrg The above gives us a shortest feasible @code{exploded_path} through the
1.1.1.2  mrg @code{exploded_graph} (a list of @code{exploded_edge *}).  We use this
1.1.1.2  mrg @code{exploded_path} to build a @code{diagnostic_path} (a list of
1.1.1.2  mrg @strong{events} for the diagnostic subsystem) - specifically a
1.1.1.2  mrg @code{checker_path}.
    1.1  mrg
    1.1  mrg Having built the @code{checker_path}, we prune it to try to eliminate
    1.1  mrg events that aren't relevant, to minimize how much the user has to read.
    1.1  mrg
    1.1  mrg After pruning, we notify each event in the path of its ID and record the
    1.1  mrg IDs of interesting events, allowing for events to refer to other events
    1.1  mrg in their descriptions.  The @code{pending_diagnostic} class has various
    1.1  mrg vfuncs to support emitting more precise descriptions, so that e.g.
    1.1  mrg
    1.1  mrg @itemize @bullet
    1.1  mrg @item
    1.1  mrg a deref-of-unchecked-malloc diagnostic might use:
    1.1  mrg @smallexample
    1.1  mrg   returning possibly-NULL pointer to 'make_obj' from 'allocator'
    1.1  mrg @end smallexample
    1.1  mrg for a @code{return_event} to make it clearer how the unchecked value moves
    1.1  mrg from callee back to caller
    1.1  mrg @item
    1.1  mrg a double-free diagnostic might use:
    1.1  mrg @smallexample
    1.1  mrg   second 'free' here; first 'free' was at (3)
    1.1  mrg @end smallexample
    1.1  mrg and a use-after-free might use
    1.1  mrg @smallexample
    1.1  mrg   use after 'free' here; memory was freed at (2)
    1.1  mrg @end smallexample
    1.1  mrg @end itemize
    1.1  mrg
    1.1  mrg At this point we can emit the diagnostic.
    1.1  mrg
    1.1  mrg @subsection Limitations
    1.1  mrg
    1.1  mrg @itemize @bullet
    1.1  mrg @item
    1.1  mrg Only for C so far
    1.1  mrg @item
    1.1  mrg The implementation of call summaries is currently very simplistic.
    1.1  mrg @item
    1.1  mrg Lack of function pointer analysis
    1.1  mrg @item
    1.1  mrg The constraint-handling code assumes reflexivity in some places
    1.1  mrg (that values are equal to themselves), which is not the case for NaN.
    1.1  mrg As a simple workaround, constraints on floating-point values are
    1.1  mrg currently ignored.
    1.1  mrg @item
    1.1  mrg There are various other limitations in the region model (grep for TODO/xfail
    1.1  mrg in the testsuite).
    1.1  mrg @item
    1.1  mrg The constraint_manager's implementation of transitivity is currently too
    1.1  mrg expensive to enable by default and so must be manually enabled via
    1.1  mrg @option{-fanalyzer-transitivity}).
    1.1  mrg @item
    1.1  mrg The checkers are currently hardcoded and don't allow for user extensibility
    1.1  mrg (e.g. adding allocate/release pairs).
    1.1  mrg @item
    1.1  mrg Although the analyzer's test suite has a proof-of-concept test case for
    1.1  mrg LTO, LTO support hasn't had extensive testing.  There are various
    1.1  mrg lang-specific things in the analyzer that assume C rather than LTO.
    1.1  mrg For example, SSA names are printed to the user in ``raw'' form, rather
    1.1  mrg than printing the underlying variable name.
    1.1  mrg @end itemize
    1.1  mrg
    1.1  mrg @node Debugging the Analyzer
    1.1  mrg @section Debugging the Analyzer
    1.1  mrg @cindex analyzer, debugging
    1.1  mrg @cindex static analyzer, debugging
    1.1  mrg
    1.1  mrg @subsection Special Functions for Debugging the Analyzer
    1.1  mrg
    1.1  mrg The analyzer recognizes various special functions by name, for use
    1.1  mrg in debugging the analyzer.  Declarations can be seen in the testsuite
    1.1  mrg in @file{analyzer-decls.h}.  None of these functions are actually
    1.1  mrg implemented.
    1.1  mrg
    1.1  mrg Add:
    1.1  mrg @smallexample
    1.1  mrg   __analyzer_break ();
    1.1  mrg @end smallexample
    1.1  mrg to the source being analyzed to trigger a breakpoint in the analyzer when
    1.1  mrg that source is reached.  By putting a series of these in the source, it's
    1.1  mrg much easier to effectively step through the program state as it's analyzed.
    1.1  mrg
1.1.1.2  mrg The analyzer handles:
1.1.1.2  mrg
1.1.1.2  mrg @smallexample
1.1.1.2  mrg __analyzer_describe (0, expr);
1.1.1.2  mrg @end smallexample
1.1.1.2  mrg
1.1.1.2  mrg by emitting a warning describing the 2nd argument (which can be of any
1.1.1.2  mrg type), at a verbosity level given by the 1st argument.  This is for use when
1.1.1.2  mrg debugging, and may be of use in DejaGnu tests.
1.1.1.2  mrg
    1.1  mrg @smallexample
    1.1  mrg __analyzer_dump ();
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg will dump the copious information about the analyzer's state each time it
    1.1  mrg reaches the call in its traversal of the source.
    1.1  mrg
    1.1  mrg @smallexample
1.1.1.2  mrg extern void __analyzer_dump_capacity (const void *ptr);
1.1.1.2  mrg @end smallexample
1.1.1.2  mrg
1.1.1.2  mrg will emit a warning describing the capacity of the base region of
1.1.1.2  mrg the region pointed to by the 1st argument.
1.1.1.2  mrg
1.1.1.2  mrg @smallexample
1.1.1.2  mrg extern void __analyzer_dump_escaped (void);
1.1.1.2  mrg @end smallexample
1.1.1.2  mrg
1.1.1.2  mrg will emit a warning giving the number of decls that have escaped on this
1.1.1.2  mrg analysis path, followed by a comma-separated list of their names,
1.1.1.2  mrg in alphabetical order.
1.1.1.2  mrg
1.1.1.2  mrg @smallexample
    1.1  mrg __analyzer_dump_path ();
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg will emit a placeholder ``note'' diagnostic with a path to that call site,
    1.1  mrg if the analyzer finds a feasible path to it.
    1.1  mrg
    1.1  mrg The builtin @code{__analyzer_dump_exploded_nodes} will emit a warning
    1.1  mrg after analysis containing information on all of the exploded nodes at that
    1.1  mrg program point:
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg   __analyzer_dump_exploded_nodes (0);
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg will output the number of ``processed'' nodes, and the IDs of
    1.1  mrg both ``processed'' and ``merger'' nodes, such as:
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg warning: 2 processed enodes: [EN: 56, EN: 58] merger(s): [EN: 54-55, EN: 57, EN: 59]
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg With a non-zero argument
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg   __analyzer_dump_exploded_nodes (1);
    1.1  mrg @end smallexample
    1.1  mrg
    1.1  mrg it will also dump all of the states within the ``processed'' nodes.
    1.1  mrg
    1.1  mrg @smallexample
    1.1  mrg    __analyzer_dump_region_model ();
    1.1  mrg @end smallexample
    1.1  mrg will dump the region_model's state to stderr.
    1.1  mrg
    1.1  mrg @smallexample
1.1.1.2  mrg __analyzer_dump_state ("malloc", ptr);
1.1.1.2  mrg @end smallexample
1.1.1.2  mrg
1.1.1.2  mrg will emit a warning describing the state of the 2nd argument
1.1.1.2  mrg (which can be of any type) with respect to the state machine with
1.1.1.2  mrg a name matching the 1st argument (which must be a string literal).
1.1.1.2  mrg This is for use when debugging, and may be of use in DejaGnu tests.
1.1.1.2  mrg
1.1.1.2  mrg @smallexample
    1.1  mrg __analyzer_eval (expr);
    1.1  mrg @end smallexample
    1.1  mrg will emit a warning with text "TRUE", FALSE" or "UNKNOWN" based on the
    1.1  mrg truthfulness of the argument.  This is useful for writing DejaGnu tests.
    1.1  mrg
    1.1  mrg
    1.1  mrg @subsection Other Debugging Techniques
    1.1  mrg
1.1.1.2  mrg The option @option{-fdump-analyzer-json} will dump both the supergraph
1.1.1.2  mrg and the exploded graph in compressed JSON form.
1.1.1.2  mrg
    1.1  mrg One approach when tracking down where a particular bogus state is
    1.1  mrg introduced into the @code{exploded_graph} is to add custom code to
1.1.1.2  mrg @code{program_state::validate}.
    1.1  mrg
1.1.1.2  mrg The debug function @code{region::is_named_decl_p} can be used when debugging,
1.1.1.2  mrg such as for assertions and conditional breakpoints.  For example, when
1.1.1.2  mrg tracking down a bug in handling a decl called @code{yy_buffer_stack}, I
1.1.1.2  mrg temporarily added a:
    1.1  mrg @smallexample
1.1.1.2  mrg   gcc_assert (!m_base_region->is_named_decl_p ("yy_buffer_stack"));
    1.1  mrg @end smallexample
1.1.1.2  mrg to @code{binding_cluster::mark_as_escaped} to trap a point where
1.1.1.2  mrg @code{yy_buffer_stack} was mistakenly being treated as having escaped.