GitHub - obfusk/reproducible-apk-tools: reproducible-apk-tools - scripts to make android apks reproducible (original) (raw)

GitHub Release PyPI Version Python Versions CI AGPLv3+

fix-compresslevel.py,fix-files.py,fix-newlines.py,fix-pg-map-id.py,rm-files.py,sort-apk.py,sort-baseline.py,zipalign.py;

binres.py,diff-zip-meta.py,dump-arsc.py,dump-axml.py,dump-baseline.py,list-compresslevel.py,zipalignment.py,zipinfo.py;

inplace-fix.py.

scripts to make android apks reproducible

reproducible-apk-tools is a collection of scripts (available as subcommands of the repro-apk command) to help make APKs reproducible (e.g. by changing line endings from LF to CRLF), or find out why they are not (e.g. by comparing ZIP file metadata, or dumping baseline.prof files).

fix-compresslevel.py

Recompress with different compression level.

Specify which files to change by providing at least one fnmatch-style pattern, e.g. 'assets/foo/*.bar'.

If two APKs have identical contents but some ZIP entries are compressed with a different compression level, thus making the APKs not bit-by-bit identical, this script may help.

$ fix-compresslevel.py --help usage: fix-compresslevel.py [-h] [-v] INPUT_APK OUTPUT_APK COMPRESSLEVEL PATTERN [PATTERN ...] [...] $ apksigcopier compare signed.apk --unsigned unsigned.apk DOES NOT VERIFY [...] $ fix-compresslevel.py unsigned.apk fixed.apk 6 assets/foo/bar.js fixing 'assets/foo/bar.js'... $ zipalign -f 4 fixed.apk fixed-aligned.apk $ apksigcopier compare signed.apk --unsigned fixed-aligned.apk && echo OK OK

NB: this builds a new ZIP file, preserving most ZIP metadata (and recompressing entries not matching the pattern using the same compression level as in the original APK) but not everything: e.g. copying the existing local header extra fields which contain padding for alignment is not supported by Python'sZipFile, which is why zipalign is usually needed.

fix-files.py

Process ZIP entries using an external command.

Runs the command for each specified file, providing the old file contents as stdin and using stdout as the new file contents.

The provided command is split on whitespace to allow passing arguments (e.g.'foo --bar'), but shell syntax is not supported.

Specify which files to process by providing at least one fnmatch-style pattern, e.g. 'META-INF/services/*'.

$ fix-files.py --help usage: fix-files.py [-h] [-v] [--compresslevel PATTERN:LEVELS] INPUT_APK OUTPUT_APK COMMAND PATTERN [PATTERN ...] [...] $ apksigcopier compare signed.apk --unsigned unsigned.apk DOES NOT VERIFY [...] $ fix-files.py unsigned.apk fixed.apk unix2dos 'META-INF/services/*' processing 'META-INF/services/foo' with 'unix2dos'... processing 'META-INF/services/bar' with 'unix2dos'... $ zipalign -f 4 fixed.apk fixed-aligned.apk $ apksigcopier compare signed.apk --unsigned fixed-aligned.apk && echo OK OK

NB: this builds a new ZIP file, preserving most ZIP metadata (and recompressing using the same compression level) but not everything: e.g. copying the existing local header extra fields which contain padding for alignment is not supported by Python's ZipFile, which is why zipalign is usually needed.

fix-newlines.py

Change line endings from LF to CRLF (or vice versa w/ --from-crlf).

Specify which files to change by providing at least one fnmatch-style pattern, e.g. 'META-INF/services/*'.

If the signed APK was built on Windows and has e.g. META-INF/services/ files with CRLF line endings whereas the unsigned APK was build on Linux/macOS and has LF line endings, this script may help.

$ fix-newlines.py --help usage: fix-newlines.py [-h] [--from-crlf] [--to-crlf] [--compresslevel PATTERN:LEVELS] [-v] INPUT_APK OUTPUT_APK PATTERN [PATTERN ...] [...] $ apksigcopier compare signed.apk --unsigned unsigned.apk DOES NOT VERIFY [...] $ fix-newlines.py unsigned.apk fixed.apk 'META-INF/services/*' fixing 'META-INF/services/foo'... fixing 'META-INF/services/bar'... $ zipalign -f 4 fixed.apk fixed-aligned.apk $ apksigcopier compare signed.apk --unsigned fixed-aligned.apk && echo OK OK

NB: this builds a new ZIP file, preserving most ZIP metadata (and recompressing using the same compression level) but not everything: e.g. copying the existing local header extra fields which contain padding for alignment is not supported by Python's ZipFile, which is why zipalign is usually needed.

fix-pg-map-id.py

Replace non-deterministic R8 pg-map-id in classes.dex (and classes2.dexetc. when present) and update checksums, also in baseline.prof.

$ fix-pg-map-id.py --help usage: fix-pg-map-id.py [-h] INPUT_DIR_OR_APK OUTPUT_DIR_OR_APK PG_MAP_ID [...] $ apksigcopier compare signed.apk --unsigned unsigned.apk DOES NOT VERIFY [...] $ diff -Naur <( unzip -p signed.apk classes.dex | xxd ) <( unzip -p unsigned.apk classes.dex | xxd ) [...] -00000000: 6465 780a 3033 3500 829f 202e c800 6ecc dex.035... ...n. -00000010: c15c 0a17 3737 73fc 982e 34db 6239 bc52 ...77s...4.b9.R +00000000: 6465 780a 3033 3500 d89f 1795 f719 4d94 dex.035.......M. +00000010: 8358 2526 2850 f9e5 ad3d e772 c82e 4f02 .X%&(P...=.r..O. [...] 005f1010: 2d61 7069 223a 3233 2c22 7067 2d6d 6170 -api":23,"pg-map -005f1020: 2d69 6422 3a22 3261 3939 3764 3322 2c22 -id":"2a997d3"," +005f1020: 2d69 6422 3a22 6565 3436 3531 3322 2c22 -id":"ee46513"," [...] $ fix-pg-map-id.py unsigned.apk fixed.apk 2a997d3 reading 'assets/dexopt/baseline.prof'... reading 'classes.dex'... reading 'classes2.dex'... fixing 'classes.dex'... dex version=035 fixing pg-map-id: b'ee46513' -> b'2a997d3' fixing signature: f7194d94835825262850f9e5ad3de772c82e4f02 -> c8006eccc15c0a17373773fc982e34db6239bc52 fixing checksum: 0x95179fd8 -> 0x2e209f82 fixing 'classes2.dex'... dex version=035 (not modified) fixing 'assets/dexopt/baseline.prof'... prof version=010 P fixing 'classes.dex' checksum: 0x95d40f72 -> 0x50191ba4 writing 'assets/dexopt/baseline.prof'... writing 'classes.dex'... writing 'classes2.dex'... $ zipalign -f 4 fixed.apk fixed-aligned.apk $ apksigcopier compare signed.apk --unsigned fixed-aligned.apk && echo OK OK

NB: this builds a new ZIP file, preserving most ZIP metadata (and recompressing using the same compression level) but not everything: e.g. copying the existing local header extra fields which contain padding for alignment is not supported by Python's ZipFile, which is why zipalign is usually needed.

rm-files.py

Remove entries from ZIP file.

Specify which files to remove by providing at least one fnmatch-style pattern, e.g. 'META-INF/MANIFEST.MF'.

$ rm-files.py --help usage: rm-files.py [-h] [-v] INPUT_APK OUTPUT_APK PATTERN [PATTERN ...] [...] $ rm-files.py some.apk fixed.apk META-INF/MANIFEST.IN skipping 'META-INF/MANIFEST.IN'... $ zipalign -f 4 fixed.apk fixed-aligned.apk

NB: this builds a new ZIP file, preserving most ZIP metadata (and recompressing using the same compression level) but not everything: e.g. copying the existing local header extra fields which contain padding for alignment is not supported by Python's ZipFile, which is why zipalign is usually needed.

sort-apk.py

Sort (and w/o --no-realign also realign) the ZIP entries of an APK.

If the ordering of the ZIP entries in an APK is not deterministic/reproducible, this script may help. You'll almost certainly need to use it for all builds though, since it can only sort the APK, not recreate a different ordering that is deterministic but not sorted; see also the alignment CAVEAT.

$ sort-apk.py --help usage: sort-apk.py [-h] [--no-realign] [--no-force-align] [--reset-lh-extra] INPUT_APK OUTPUT_APK [...] $ unzip -l unsigned.apk Archive: unsigned.apk Length Date Time Name


    6  2017-05-15 11:24   lib/armeabi/fake.so
 1672  2009-01-01 00:00   AndroidManifest.xml
  896  2009-01-01 00:00   resources.arsc
 1536  2009-01-01 00:00   classes.dex

 4110                     4 files

$ sort-apk.py unsigned.apk sorted.apk $ unzip -l sorted.apk Archive: sorted.apk Length Date Time Name


 1672  2009-01-01 00:00   AndroidManifest.xml
 1536  2009-01-01 00:00   classes.dex
    6  2017-05-15 11:24   lib/armeabi/fake.so
  896  2009-01-01 00:00   resources.arsc

 4110                     4 files

NB: this directly copies the (bytes of the) original ZIP entries from the original file, thus preserving all ZIP metadata.

CAVEAT: alignment

Unfortunately, the padding added to ZIP local header extra fields for alignment makes it hard to make sorting deterministic: unless the original APK was not aligned at all, the padding is often different when the APK entries had a different order (and thus a different offset) before sorting.

Because of this, sort-apk forcefully recreates the padding even if the entry is already aligned (since that doesn't mean the padding is identical) to make its output as deterministic as possible. The downside is that it'll often add "unnecessary" 8-byte padding to entries that didn't need alignment.

You can disable this using --no-force-align, or skip realignment completely using --no-realign. If you're certain you don't need to keep the old values, you can also choose to reset the local header extra fields to the values from the central directory entries with --reset-lh-extra.

If you use --reset-lh-extra, you'll probably want to combine it with either--no-force-align (which should prevent the "unnecessary" 8-byte padding) or--no-realign + zipalign (which uses smaller padding).

NB: the alignment padding used by sort-apk is the same as that used byapksigner (a 0xd935 "Android ZIP Alignment Extra Field" which stores the alignment itself plus zero padding and is thus always at least 6 bytes), whereaszipalign just uses plain zero padding.

sort-baseline.py

Sort baseline.profm (extracted or inside an APK).

$ sort-baseline.py --help usage: sort-baseline.py [-h] [--apk] INPUT_PROF_OR_APK OUTPUT_PROF_OR_APK [...] $ diff -qs a/baseline.profm b/baseline.profm Files a/baseline.profm and b/baseline.profm differ $ sort-baseline.py a/baseline.profm a/baseline-sorted.profm $ sort-baseline.py b/baseline.profm b/baseline-sorted.profm $ diff -qs a/baseline-sorted.profm b/baseline-sorted.profm Files a/baseline-sorted.profm and b/baseline-sorted.profm are identical

$ sort-baseline.py --apk unsigned.apk sorted-baseline.apk $ zipalign -f 4 sorted-baseline.apk sorted-baseline-aligned.apk

NB: does not support all file format versions yet.

NB: with --apk, this builds a new ZIP file, preserving most ZIP metadata (and recompressing using the same compression level) but not everything: e.g. copying the existing local header extra fields which contain padding for alignment is not supported by Python's ZipFile, which is why zipalign is usually needed.

zipalign.py

Align uncompressed ZIP/APK entries to 4-byte boundaries (and .so shared object files to 4096-byte boundaries with -p/--page-align, or other page sizes with-P/--page-size).

This implementation aims for compatibility with Android's zipalign, with the exception of there not being a -f option to enable overwriting an existing output file (it will always be overwritten), and the ALIGN parameter -- which must always be 4 anyway -- being optional; nor does it support the -c, -v, or -z options.

By default, the same plain zero padding as the original zipalign is used, but with the --pad-like-apksigner option it uses the same alignment padding asapksigner (a 0xd935 "Android ZIP Alignment Extra Field" which stores the alignment itself plus zero padding and is thus always at least 6 bytes).

$ zipalign.py --help usage: zipalign.py [-h] [-p] [-P N] [--pad-like-apksigner] [--replace] [--copy-extra] [--no-update-lfh] [ALIGN] INPUT_APK OUTPUT_APK [...] $ zipalign -f 4 fixed.apk fixed-aligned.apk $ zipalign.py fixed.apk fixed-aligned-py.apk $ cmp fixed-aligned.apk fixed-aligned-py.apk && echo OK OK

binres.py

Parse/dump android binary XML (AXML) or resources (ARSC).

NB: work in progress; output format may change.

dump

Parse & dump ARSC or AXML.

$ binres.py dump --help usage: binres.py dump [-h] [--apk APK] [--json] [--xml] [--deref] [--prolog] [-q] [-v] FILE_OR_PATTERN [FILE_OR_PATTERN ...] [...] $ binres.py dump AndroidManifest.xml file='AndroidManifest.xml' XML STRING POOL [#strings=16, #styles=0] XML RESOURCE MAP [#resources=6] XML NS START [lineno=1, prefix='android', uri='http://schemas.android.com/apk/res/android'] XML ELEM START [lineno=1, name='manifest', #attributes=7] ATTR: http://schemas.android.com/apk/res/android:versionCode=1 ATTR: http://schemas.android.com/apk/res/android:versionName='1' ATTR: http://schemas.android.com/apk/res/android:compileSdkVersion=29 ATTR: http://schemas.android.com/apk/res/android:compileSdkVersionCodename='10.0.0' ATTR: package='com.example' ATTR: platformBuildVersionCode=29 ATTR: platformBuildVersionName='10.0.0' XML ELEM START [lineno=2, name='uses-sdk', #attributes=2] ATTR: http://schemas.android.com/apk/res/android:minSdkVersion=21 ATTR: http://schemas.android.com/apk/res/android:targetSdkVersion=29 XML ELEM END [lineno=2, name='uses-sdk'] XML ELEM END [lineno=1, name='manifest'] XML NS END [lineno=1, prefix='android', uri='http://schemas.android.com/apk/res/android'] $ binres.py dump --xml AndroidManifest.xml

$ binres.py dump --apk some.apk '*.arsc' '*.xml' entry='AndroidManifest.xml' XML STRING POOL [#strings=26, #styles=0] [...] entry='resources.arsc' RESOURCE TABLE STRING POOL [flags=0x100, #strings=3, #styles=0] [...]

fastid

Quickly get appid & version code/name from APK(s).

$ binres.py fastid --help usage: binres.py fastid [-h] [--json] [--short] APK [APK ...] [...] $ binres.py fastid some.apk package=com.example versionCode=1 versionName=1 $ binres.py fastid --short some.apk com.example 1 1 $ binres.py fastid --json some.apk [ { "package": "com.example", "versionCode": 1, "versionName": "1" } ]

fastperms

Quickly get permissions from APK(s).

$ binres.py fastperms --help usage: binres.py fastperms [-h] [--json] [--with-id] [-q] APK [APK ...] [...] $ binres.py fastperms some.apk file='some.apk' permission=android.permission.CAMERA permission=android.permission.READ_EXTERNAL_STORAGE [maxSdkVersion=23] $ binres.py fastperms --with-id some.apk file='some.apk' package=com.example versionCode=1 versionName=1 permission=android.permission.CAMERA permission=android.permission.READ_EXTERNAL_STORAGE [maxSdkVersion=23] $ binres.py fastperms --json some.apk [ { "permissions": [ { "permission": "android.permission.CAMERA", "attributes": {} }, { "permission": "android.permission.READ_EXTERNAL_STORAGE", "attributes": { "maxSdkVersion": "23" } } ] } ] $ binres.py fastperms --json --with-id some.apk [ { "package": "com.example", "versionCode": 1, "versionName": "1", "permissions": [ { "permission": "android.permission.CAMERA", "attributes": {} }, { "permission": "android.permission.READ_EXTERNAL_STORAGE", "attributes": { "maxSdkVersion": "23" } } ] } ]

manifest-info

Dump basic manifest info from APK(s) as JSON.

$ binres.py manifest-info --help usage: binres.py manifest-info [-h] APK [APK ...] [...] $ binres.py manifest-info catima.apk { "catima.apk": { "appid": "me.hackerchick.catima", "version_code": 134, "version_name": "2.29.0", "min_sdk": 21, "target_sdk": 34, "features": [ { "name": "android.hardware.camera", "required": true }, [...] ], "permissions": [ { "name": "android.permission.CAMERA", "min_sdk_version": null, "max_sdk_version": null }, { "name": "android.permission.READ_EXTERNAL_STORAGE", "min_sdk_version": null, "max_sdk_version": 23 }, [...] ], "abis": [] } } $ binres.py manifest-info other.apk | jq '.[]|.abis' [ "arm64-v8a", "armeabi-v7a", "x86", "x86_64" ] $ binres.py manifest-info /dev/null { "/dev/null": { "error": "Expected end of central directory record (EOCD)" } }

NB: parse errors etc. will be returned as JSON and the exit status will be 1 if there are any.

diff-zip-meta.py

Diff ZIP file metadata.

NB: this will not compare the contents of the ZIP entries, only metadata and other non-contents bytes; to compare the contents of ZIP/APK files, use e.g.diffoscope.

This will show differences in filenames, central directory headers, local file headers, data descriptors, entry sizes, etc.

Additional tests include compression level (if it can be determined), CRC32 checksum of compressed data, and extra data before entries or the central directory; you can skip these (relatively slow) tests using --no-additional.

Some differences make the output quite verbose and/or are usually the result of other differences; you can skip/ignore these using --no-lfh-extra,--no-offsets, --no-ordering.

$ diff-zip-meta.py --help usage: diff-zip-meta.py [-h] [--no-additional] [--no-lfh-extra] [--no-offsets] [--no-ordering] ZIPFILE1 ZIPFILE2 [...] $ diff-zip-meta.py a.apk b.apk --- a.apk +++ b.apk entry foo:

NB: work in progress; output format may change.

dump-arsc.py

Dump resources.arsc (extracted or inside an APK) using aapt2.

$ dump-arsc.py --help usage: dump-arsc.py [-h] [--apk] ARSC_OR_APK [...] $ dump-arsc.py resources.arsc Binary APK Package name=com.example.app id=7f [...] $ dump-arsc.py --apk some.apk Binary APK Package name=com.example.app id=7f [...]

dump-axml.py

Dump Android binary XML (extracted or inside an APK) using aapt2.

$ dump-axml.py --help usage: dump-axml.py [-h] [--apk APK] AXML [...] $ dump-axml.py foo.xml N: android=http://schemas.android.com/apk/res/android (line=17) E: selector (line=17) E: item (line=18) [...] $ dump-axml.py --apk some.apk res/foo.xml N: android=http://schemas.android.com/apk/res/android (line=17) E: selector (line=17) E: item (line=18) [...]

dump-baseline.py

Dump baseline.prof/baseline.profm (extracted or inside an APK).

$ dump-baseline.py --help usage: dump-baseline.py [-h] [--apk] [-v] PROF_OR_APK [...] $ dump-baseline.py baseline.prof prof version=010 P num_dex_files=4 [...] $ dump-baseline.py baseline.profm profm version=002 num_dex_files=4 [...] $ dump-baseline.py some.apk entry=assets/dexopt/baseline.prof prof version=010 P num_dex_files=4 [...] entry=assets/dexopt/baseline.profm profm version=002 num_dex_files=4 [...]

NB: does not support all file format versions yet.

list-compresslevel.py

List ZIP entries with compression level.

You can optionally specify which files to list by providing one or more fnmatch-style patterns, e.g. 'assets/foo/*.bar'.

$ list-compresslevel.py --help usage: list-compresslevel.py [-h] [--levels LEVELS] APK [PATTERN ...] [...] $ list-compresslevel.py some.apk filename='AndroidManifest.xml' compresslevel=9|6 filename='classes.dex' compresslevel=None filename='resources.arsc' compresslevel=None [...] filename='META-INF/CERT.SF' compresslevel=9|6 filename='META-INF/CERT.RSA' compresslevel=9|6|4 filename='META-INF/MANIFEST.MF' compresslevel=9|6|4

NB: the compression level is not actually stored anywhere in the ZIP file, and is thus calculated by recompressing the data with different compression levels and checking the CRC32 of the result against the CRC32 of the original compressed data.

zipalignment.py

Show info about ZIP alignment.

$ zipalignment.py --help usage: zipalignment.py [-h] APK [APK ...] [...] $ zipalignment.py foo.apk bar.apk file='foo.apk' zipaligned (4-byte alignment) : yes files with apksigner padding : 0 apksigner alignments from extra fields : none most likely uncompressed .so page alignment : 4KiB file='bar.apk' zipaligned (4-byte alignment) : yes files with apksigner padding : 123 apksigner alignments from extra fields : 4 16384 most likely uncompressed .so page alignment : 16KiB

zipinfo.py

List ZIP entries (like zipinfo).

This implementation aims for compatibility with the default and -l output formats of Info-ZIP's zipinfo; the -e extended output format is unique to this implementation. Other formats and options are (currently) not supported.

Neither is the full variety of ZIP formats and extensions supported, just the most common ones (UNIX, FAT, NTFS).

The -l/--long option adds the compressed size before the compression type;-e/--extended does the same, adds the CRC32 checksum before the filename as well, uses a more standard date format, and treats filenames ending with a /as directories.

$ zipinfo.py --help usage: zipinfo.py [-h] [-1] [-e] [-l] [--sort-by-offset] ZIPFILE [...] $ zipinfo.py -e some.apk Archive: some.apk Zip file size: 5612 bytes, number of entries: 8 drw---- 2.0 fat 0 bX 2 defN 2017-05-15 11:25:18 00000000 META-INF/ -rw---- 2.0 fat 77 bl 76 defN 2017-05-15 11:25:18 b506b894 META-INF/MANIFEST.MF -rw---- 2.0 fat 1672 bl 630 defN 2009-01-01 00:00:00 615ef200 AndroidManifest.xml -rw---- 1.0 fat 1536 b- 1536 stor 2009-01-01 00:00:00 9987d5d8 classes.dex -rw---- 2.0 fat 29 bl 6 defN 2017-05-15 11:26:52 ff801cd1 temp.txt -rw---- 1.0 fat 6 b- 6 stor 2017-05-15 11:24:32 31963516 lib/armeabi/fake.so -rw---- 1.0 fat 896 b- 896 stor 2009-01-01 00:00:00 4fcab821 resources.arsc -rw---- 2.0 fat 20 bl 6 defN 2017-05-15 11:28:40 c9983e85 temp2.txt 8 files, 4236 bytes uncompressed, 3158 bytes compressed: 25.4% $ zipinfo.py -l some.apk Archive: some.apk Zip file size: 5612 bytes, number of entries: 8 -rw---- 2.0 fat 0 bX 2 defN 17-May-15 11:25 META-INF/ -rw---- 2.0 fat 77 bl 76 defN 17-May-15 11:25 META-INF/MANIFEST.MF -rw---- 2.0 fat 1672 bl 630 defN 09-Jan-01 00:00 AndroidManifest.xml -rw---- 1.0 fat 1536 b- 1536 stor 09-Jan-01 00:00 classes.dex -rw---- 2.0 fat 29 bl 6 defN 17-May-15 11:26 temp.txt -rw---- 1.0 fat 6 b- 6 stor 17-May-15 11:24 lib/armeabi/fake.so -rw---- 1.0 fat 896 b- 896 stor 09-Jan-01 00:00 resources.arsc -rw---- 2.0 fat 20 bl 6 defN 17-May-15 11:28 temp2.txt 8 files, 4236 bytes uncompressed, 3158 bytes compressed: 25.4%

The fields are: permissions, create version, create system, uncompressed size, extra info, compressed size (w/ --long or --extended), compression type, date, time, CRC32 (w/ --extended), filename.

The extra info field consists of two characters: the first is b for binary,t for text (uppercase for encrypted files); the second is X for data descriptor and extra field, l for just data descriptor, x for just extra field, - for neither.

See also:zipinfo(1),zipdetails(1).

helper scripts

inplace-fix.py

Convenience wrapper for some of the other scripts like fix-newlines that makes them modify the file in-place (and optionally zipalign it too).

$ inplace-fix.py --help usage: inplace-fix.py [-h] [--zipalign] [--page-align] [--page-size N] [--internal] COMMAND INPUT_FILE [...] [...] $ inplace-fix.py --zipalign fix-newlines unsigned.apk 'META-INF/services/' [RUN] python3 fix-newlines.py unsigned.apk /tmp/.../fixed.apk META-INF/services/ fixing 'META-INF/services/foo'... fixing 'META-INF/services/bar'... [RUN] zipalign 4 /tmp/.../fixed.apk /tmp/.../aligned.apk [MOVE] /tmp/.../aligned.apk to unsigned.apk

If zipalign is not found on $PATH but any of $ANDROID_HOME,$ANDROID_SDK, or $ANDROID_SDK_ROOT is set to an Android SDK directory, it will use zipalign from the latest build-tools subdirectory of the Android SDK. If no suitable zipalign command can be found this way or the--internal option is passed, zipalign.py will be used.

NB: build-tools 31.0.0 and 32.0.0 are skipped becausetheir zipalign is broken;--page-size requires build-tools >= 35.0.0-rc1.

NB: this script is not available as a repro-apk subcommand, but as a separaterepro-apk-inplace-fix command.

gradle integration

You can e.g. sort baseline.profm during the gradle build by adding something like this to your build.gradle:

Details

// NB: assumes reproducible-apk-tools is a submodule in the app repo's // root dir; adjust the path accordingly if it is found elsewhere project.afterEvaluate { tasks.compileReleaseArtProfile.doLast { outputs.files.each { file -> if (file.name.endsWith(".profm")) { exec { commandLine( "../reproducible-apk-tools/inplace-fix.py", "sort-baseline", file ) } } } } }

Alternatively, adding something like this allows you to modify the APK itself after building (and re-sign it if necessary):

Details

// NB: assumes reproducible-apk-tools is a submodule in the app repo's // root dir; adjust the path accordingly if it is found elsewhere android { applicationVariants.all { variant -> variant.outputs.each { output -> variant.packageApplicationProvider.get().doLast { exec { // set ANDROID_HOME for zipalign environment "ANDROID_HOME", android.sdkDirectory commandLine( "../reproducible-apk-tools/inplace-fix.py", "--zipalign", "fix-newlines", output.outputFile, "META-INF/services/*" ) } // re-sign w/ apksigner if needed if (variant.signingConfig != null) { def tools = "${android.sdkDirectory}/build-tools/${android.buildToolsVersion}" def sc = variant.signingConfig exec { environment "KS_PASS", sc.storePassword environment "KEY_PASS", sc.keyPassword commandLine( "${tools}/apksigner", "sign", "-v", "--ks", sc.storeFile, "--ks-pass", "env:KS_PASS", "--ks-key-alias", sc.keyAlias, "--key-pass", "env:KEY_PASS", output.outputFile ) } } } } } }

fnmatch-style patterns

Some of these scripts process/list files matching any of the provided patterns using Python's fnmatch.fnmatch(), Unix shell style:

*       matches everything
?       matches any single character
[seq]   matches any character in seq
[!seq]  matches any char not in seq

With one addition: an optional prefix ! negates the pattern, invalidating a successful match by any preceding pattern; use a backslash (\) in front of the first ! for patterns that begin with a literal !.

NB: to match e.g. everything except for *.xml, you need to provide two patterns: the first ('*') to match everything, the second ('!*.xml') to negate matching *.xml.

NB: * matches anything, including /, and the pattern matches the complete filename path, including leading directories, so e.g. foo/bar.baz is matched by both *.baz and foo/*.

CLI

NB: you can just use the scripts stand-alone; alternatively, you can install therepro-apk Python package and use them as subcommands of repro-apk:

$ repro-apk binres dump AndroidManifest.xml $ repro-apk binres dump --xml AndroidManifest.xml $ repro-apk binres dump --apk some.apk '.arsc' '.xml' $ repro-apk binres fastid some.apk $ repro-apk binres fastid --short some.apk $ repro-apk binres fastid --json some.apk $ repro-apk binres fastperms some.apk $ repro-apk binres fastperms --with-id some.apk $ repro-apk binres fastperms --json some.apk $ repro-apk binres fastperms --json --with-id some.apk $ repro-apk binres manifest-info catima.apk $ repro-apk diff-zip-meta a.apk b.apk $ repro-apk diff-zip-meta a.apk c.apk --no-offsets --no-ordering $ repro-apk dump-arsc resources.arsc $ repro-apk dump-arsc --apk some.apk $ repro-apk dump-axml foo.xml $ repro-apk dump-axml --apk some.apk res/foo.xml $ repro-apk dump-baseline baseline.prof $ repro-apk dump-baseline baseline.profm $ repro-apk dump-baseline --apk some.apk $ repro-apk fix-compresslevel unsigned.apk fixed.apk 6 assets/foo/bar.js $ repro-apk fix-files unsigned.apk fixed.apk unix2dos 'META-INF/services/' $ repro-apk fix-newlines unsigned.apk fixed.apk 'META-INF/services/' $ repro-apk fix-pg-map-id unsigned.apk fixed.apk da39a3e $ repro-apk fix-pg-map-id input-dir output-dir da39a3e $ repro-apk list-compresslevel some.apk $ repro-apk rm-files some.apk fixed.apk META-INF/MANIFEST.IN $ repro-apk sort-apk unsigned.apk sorted.apk $ repro-apk sort-baseline baseline.profm baseline-sorted.profm $ repro-apk sort-baseline --apk unsigned.apk sorted-baseline.apk $ repro-apk zipalign fixed.apk fixed-aligned-py.apk $ repro-apk zipalignment foo.apk bar.apk $ repro-apk zipinfo -e some.apk $ repro-apk zipinfo -l some.apk

Help

$ repro-apk --help $ repro-apk binres --help $ repro-apk binres dump --help $ repro-apk binres fastid --help $ repro-apk binres fastperms --help $ repro-apk binres manifest-info --help $ repro-apk diff-zip-meta --help $ repro-apk dump-arsc --help $ repro-apk dump-axml --help $ repro-apk dump-baseline --help $ repro-apk fix-compresslevel --help $ repro-apk fix-files --help $ repro-apk fix-newlines --help $ repro-apk fix-pg-map-id --help $ repro-apk list-compresslevel --help $ repro-apk rm-files --help $ repro-apk sort-apk --help $ repro-apk sort-baseline --help $ repro-apk zipalign --help $ repro-apk zipalignment --help $ repro-apk zipinfo --help

Installing

Using pip

NB: depending on your system you may need to use e.g. pip3 --userinstead of just pip.

From git

NB: this installs the latest development version, not the latest release.

$ git clone https://github.com/obfusk/reproducible-apk-tools.git $ cd reproducible-apk-tools $ pip install -e .

NB: you may need to add e.g. ~/.local/bin to your $PATH in order to run repro-apk.

To update to the latest development version:

$ cd reproducible-apk-tools $ git pull --rebase

Dependencies

Debian/Ubuntu

$ apt install python3-click $ apt install aapt # for dump-arsc.py & dump-axml.py $ apt install zipalign # for realignment; see examples

License

AGPLv3+

NB: v0.2.8 and earlier were licensed under GPLv3+.