A simple tool to generate a call graph for calls within Windows CMD (batch) files.
The tool is available on PyPI: https://pypi.org/project/cmd-call-graph/
By default, it takes the input file as stdin and outputs the resulting file to stdout, outputting logs and errors to stderr.
Given the following CMD script:
@echo off
call :foo
goto :eof
:bar
echo "in bar"
call :baz
call :baz
:baz
echo "in baz"
call powershell.exe Write-Host "Hello World from PowerShell"
:foo
echo "In foo"
goto :bar
This script would generate the following graph:
If the --show-all-calls
option is enabled, then the following graph would be generated:
If both the --show-all-calls
and --show-node-stats
options are enabled:
Invocation example for Ubuntu Linux and WSL (Windows Subsystem for Linux), assumes
Python and pip
are installed:
$ pip install cmd-call-graph
$ cmd-call-graph < your-file.cmd > your-file-call-graph.dot 2>log
The resulting dot
file can be rendered with any dot
renderer. Example with
graphviz (VIEWER
could be explorer.exe
under Windows):
$ sudo apt install graphviz
$ dot -Tpng your-file-call-graph.dot > your-file-call-graph.png
$ $VIEWER your-file-call-graph.png
Example with PowerShell:
PS C:\> choco install graphviz python3 pip
PS C:\> cmd-call-graph.exe -i your-file.cmd -o your-file-call-graph.dot
PS C:\> dot.exe -Tpng your-file-call-graph.dot -O
PS C:\> explorer.exe your-file-call-graph.dot.png
The script analyzes CMD scripts, and represents each block of text under a given label as a node in the call graph.
Each node always contains the line number where it starts, except if the node is never defined in the code,
which can happen in case of programming errors, dynamic node names (e.g., %command%
) and the eof
pseudo-node.
If a node causes the program to exit, it is marked as terminating
.
If --show-node-stats
is set, extra stats about each node are displayed, if present:
- number of lines of code (
LOC
); - number of external calls.
There are 2 special nodes:
_begin_
is a pseudo-node inserted at the start of each call graph, which represents the start of the script, which is by definition without a label;eof
, which may or may not be a pseudo-node. In CMD,eof
is a special node that is used as target ofgoto
to indicate that the current "subroutine" should terminate, or the whole program should terminate if the call stack is empty.
The eof
node is automatically removed if it's a pseudo-node and it's not reached via call
or nested
connections.
The _begin_
pseudo-node is removed if there is another node starting at line 1.
goto
: if an edge of typegoto
goes fromA
toB
, it means that in the code within the labelA
there is an instruction in the formgoto :B
.call
: if an edge of typecall
goes fromA
toB
, it means that in the code within the labelA
there is an instruction in the formcall :B
.nested
: if an edge of typenested
goes fromA
toB
, it means that in the code within the labelA
ends directly whereB
starts, and there is nogoto
orexit
statement at the end ofA
which would prevent the execution from not going intoB
asA
ends.
Example of a nested
connection:
A:
echo "foo"
echo "bar"
B:
echo "baz"
The above code would lead to a nested
connection between A
and B
.
--show-all-calls
: create one edge for each individualcall
/goto
, rather than just one for each type of connections (which is the default). Leads to a busier but more accurate graph;--show-node-stats
: augments each node in the graph with additional information about the node (i.e., number of lines of code, number of external calls);--nodes-to-hide
: hides the list of nodes passed as a space-separated list after this parameter.-v
or--verbose
: enable debug output, which will be sent to the log file;-l
or--log-file
: name of the log file. If not specified, the standard error file is used;-i
or--input
: name of the input file. If not specified, the standard input file is used;-o
or--output
: name of the output file. If not specified, the standard output file is used.
The graphs are self-explanatory: all information is codified with descriptive labels, and there is no information conveyed only with color or other types of non-text graphical hint.
Colors are used to make the graph easier to follow, but no information is conveyed only with color.
Here is what each color means in the graph:
- Orange:
goto
connection; - Blue:
call
connection; - Teal:
nested
connection; - Light gray: background for terminating nodes
Sometimes legacy code bases may contain old CMD files. This tool allows to generate a visual representation of the internal calls within the script.
This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.
When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.
This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.
Run unit tests from the project root either with the built-in unittest
module:
python -m unittest discover
Or by using pytests
, which can produce reports both for unit test success and for code coverage, by
using the following invocation:
pip install pytest
pip install pytest-cov
pytest tests --doctest-modules --junitxml=junit/test-results.xml --cov=callgraph --cov-report=xml --cov-report=html