AexPy /eɪkspaɪ/ is Api EXplorer in PYthon for detecting API breaking changes in Python packages. Explore APIs of AexPy releases, and APIs of main branch by AexPy itself.
Note
AexPy is the prototype implementation of the conference paper "AexPy: Detecting API Breaking Changes in Python Packages" in Proceedings of the 33rd IEEE International Symposium on Software Reliability Engineering (ISSRE 2022), Charlotte, North Carolina, USA, October 31 - November 3, 2022.
If you use our approach or results in your work, please cite it according to the citation file.
X. Du and J. Ma, "AexPy: Detecting API Breaking Changes in Python Packages," 2022 IEEE 33rd International Symposium on Software Reliability Engineering (ISSRE), 2022, pp. 470-481, doi: 10.1109/ISSRE55969.2022.00052.
demo.webm.mov
- How AexPy works? Approach Design & Evaluation are in AexPy's conference paper, see also talk & slides.
- How we implement AexPy? Source Code & Implemetation are in AexPy's repository, see also design (zh-cn).
- How to use AexPy? Detailed Document & Data are in AexPy's documents, see also video and online AexPy.
graph LR;
Package-->Version-1;
Package-->Version-2;
Version-1-->Preprocessing-1;
Version-2-->Preprocessing-2;
Preprocessing-1-->Extraction-1;
Preprocessing-2-->Extraction-2;
Extraction-1-->Difference;
Extraction-2-->Difference;
Difference-->Evaluation;
Evaluation-->Breaking-Changes;
AexPy also provides a framework to process Python packages, extract APIs, and detect changes, which is designed for easily reusing and customizing. See the following "Advanced Tools" section and the source code for details.
Take the package generator-oj-problem v0.0.1 and v0.0.2 as an example.
- Save API descriptions to
cache/api1.json
andcache/api2.json
- Output report to
report.txt
# Install AexPy package and tool
pip install aexpy
# Extract APIs from v0.0.1
echo generator-oj-problem@0.0.1 | aexpy extract - api1.json -r
# Extract APIs from v0.0.1
echo generator-oj-problem@0.0.2 | aexpy extract - api2.json -r
# Diff APIs between two versions
aexpy diff api1.json api2.json changes.json
View results on online AexPy.
- generator-oj-problem@0.0.1 Distribution and API
- generator-oj-problem@0.0.2 Distribution and API
- Changes and Report
See also about API Level, Call Graph, and Inheritance Diagram.
- Preprocessing
- Download packages and get source code, or use existing code base.
- Count package file sizes and lines of code.
- Read package metadata and detect top modules.
- Extracting
- Extract APIs from Python packages, including modules, classes, functions, attributes.
- Collect detailed APIs, including parameters, instance attributes.
- Detect API aliases and build call graphs, inheritance diagrams.
- Enrich type information for APIs by static type analyzers.
- Diffing
- Detect API changes after pairing APIs between two versions.
- Grade changes by their severities.
- Reporting
- Generate a human-readable report for API change detection results.
- Framework
- Customize processors and implementation details.
- Process Python packages in AexPy's general pipeline with logging and caching.
- Generate portable data in JSON for API descriptions, changes, and so on.
- Execute processing and view data by AexPy's command-line, with stdin/stdout supported.
We provide the Python package on PyPI. Use pip to install the package.
python -m pip install --upgrade aexpy
aexpy --help
Important
Please ensure your Python interpreter works in UTF-8 mode.
We also provide the Docker image to avoid environment errors.
docker pull stardustdl/aexpy:latest
docker run --rm stardustdl/aexpy:latest --help
# or the image from the main branch
docker pull stardustdl/aexpy:main
Tip
All results produced by AexPy are in JSON format, so you could modify it in any text editor.
Preprocess a distribution for a package release.
AexPy provide four preprocessing modes:
-s
,--src
: (default) Use given distribution information (path to code, package name, modules)-r
,--release
: download and unpack the package wheel and automatically load from dist-info-w
,--wheel
: Unpack existing package wheel file and automatically load from dist-info-d
,--dist
: Automatically load from unpacked wheel, and its dist-info
AexPy will automatically load package name, version, top-level modules, and dependencies from dist-info.
There are also options to specify fields in the distribution:
-p
,--project
: Package name and its version, e.g.project@version
.-m
,--module
: (multiple) Top-level module names.-D
,--depends
: (multiple) Package dependencies.-R
,--requirements
: Packagerequirements.txt
file path, to load dependencies.-P
,--pyversion
: Specify Python version for this distribution, supported Python 3.8+.
Tip
You could modify the generated distribution file in a text editor to change field values.
# download the package wheel and unpack into ./cache
# output the distribution file to ./cache/distribution.json
aexpy preprocess -r -p generator-oj-problem@0.0.1 ./cache ./cache/distribution.json
# or output the distribution file to stdout
aexpy preprocess -r -p generator-oj-problem@0.0.1 ./cache -
# use existing wheel file
aexpy preprocess -w ./cache/generator_oj_problem-0.0.1-py3-none-any.whl ./cache/distribution.json
# use existing unpacked wheel directory, auto load metadata from .dist-info directory
aexpy preprocess -d ./cache/generator_oj_problem-0.0.1-py3-none-any ./cache/distribution.json
# use existing source code directory, given the package's name, version, and top-level modules
aexpy preprocess ./cache/generator_oj_problem-0.0.1-py3-none-any ./cache/distribution.json -p generator-oj-problem@0.0.1 -m generator_oj_problem
View results at AexPy Online.
Extract the API description from a distribution.
AexPy provide four modes for the input distribution file:
-j
,--json
: (default) The file is the JSON file produced by AexPy (preprocess
command)-r
,--release
: The file is a text containing the release ID, e.g.,aexpy@0.1.0
-w
,--wheel
: The file is a wheel, i.e.,.whl
file-s
,--src
: The file is a ZIP file that contains the package code directory- Please ensure the directory is at the root of the ZIP archive
Important
About Dependencies
AexPy would dynamically import the target module to detect all available APIs. So please ensure all dependencies have been installed in the extraction environment, or specify the dependencies
field in the distribution, and AexPy will install them into the extraction environment.
If the wheelFile
field is valid (i.e. the target file exists), AexPy will firstly try to install the wheel and ignore the dependencies
field (used when the wheel installation fails).
Tip
About Environment
AexPy use micromamba as default environment manager. Use AEXPY_ENV_PROVIDER
environment variable to specify conda
, mamba
, or micromamba
.
- Use flag
--no-temp
to let AexPy use the current Python environment (as same as AexPy) as the extraction environment (the default behavior of the installed AexPy package). - Use flag
--temp
to let AexPy create a temporary mamba(conda) environment that matches the distribution's pyverion field (the default behavior of our docker image). - Use option
-e
,--env
to specify an existing mamba(conda) env name as the extraction environment (will ignore the temp flag).
aexpy extract ./cache/distribution.json ./cache/api.json
# or input the distribution file from stdin
# (this feature is also supported in other commands)
aexpy extract - ./cache/api.json
# or output the api description file to stdout
aexpy extract ./cache/distribution.json -
# extract from the target project release
echo aexpy@0.0.1 | aexpy extract - api.json -r
# extract from the wheel file
aexpy extract ./temp/aexpy-0.1.0.whl api.json -w
# extract from the project source code ZIP archive
zip -r - ./project | aexpy extract - api.json -s
# Use a env named demo-env
aexpy extract ./cache/distribution.json - -e demo-env
# Create a temporary env
aexpy extract ./cache/distribution.json - --temp
View results at AexPy Online.
Diff two API descriptions and detect changes.
aexpy diff ./cache/api1.json ./cache/api2.json ./cache/diff.json
If you have both stdin for OLD and NEW, please split two API descriptions by a comma ,
.
echo "," | cat ./api1.json - ./api2.json | aexpy diff - - ./changes.json
View results at AexPy Online.
Generate report from detect changes.
aexpy report ./cache/diff.json ./cache/report.json
View results at AexPy Online.
View produced data.
aexpy view ./cache/distribution1.json
aexpy view ./cache/distribution2.json
aexpy view ./cache/api1.json
aexpy view ./cache/api2.json
aexpy view ./cache/diff.json
aexpy view ./cache/report.json
The docker image keeps the same command-line interface, but always use stdin/stdout for host-container data transferring.
echo generator-oj-problem@0.0.1 | docker run -i aexpy/aexpy extract - - > ./api.json
cat ./api1.json <(echo ",") ./api2.json | docker run -i aexpy/aexpy diff - - - > ./changes.json
Tip
If you want to write processed data to filesystem, not the standard IO, add a volume mapping to /data
for file access.
Since the container runs in non-root user, please use root user to allow the container writing to the mounted directory.
docker run -v $pwd/cache:/data -u root aexpy/aexpy extract /data/distribution.json /data/api.json
The processing may cost time, you can use multiple -v
for verbose logs (which are outputed to stderr).
aexpy -vvv view ./cache/report.json
Add -i
or --interact
to enable interactive mode, every command will create an interactive Python shell after finishing processing. Here are some useful variable you could use in the interactive Python shell.
result
: The produced data objectcontext
: The producing context, useexception
to access the exception if failing to process
aexpy -i view ./cache/report.json
Tip
Feel free to use locals()
and dir()
to explore the interactive environment.
AexPy has four loosely-coupled stages in its pipeline. The adjacent stages transfer data by JSON, defined in models directory. You can easily write your own implementation for every stage, and combine your implementation into the pipeline.