Distributed malware processing framework based on Python, Redis and S3.
Karton is a robust framework for creating flexible and lightweight malware analysis backends. It can be used to connect malware* analysis systems into a robust pipeline with very little effort.
We've been in the automation business for a long time. We're dealing with more and more threats, and we have to automate everything to keep up with incidents. Because of this, we often end up with many scripts stuck together with duck duct tape and WD-40. These scripts are written by analysts in the heat of the moment, fragile and ugly - but they work, and produce intel that must be stored, processed further, sent to other systems or shared with other organisations.
We needed a way to take our PoC scripts and easily insert them into our analysis pipeline. We also wanted to monitor their execution, centralise logging, improve robustness, reduce development inertia... For this exact purpose, we created Karton.
* while Karton was designed with malware analysis in mind, it works nicely in every microservice-oriented project.
Installation is as easy as a single pip install
command:
pip3 install karton-core
In order to setup the whole backend environment you will also need S3-compatible storage and Redis, see the docs for details.
To use karton you have to provide class that inherits from Karton.
from karton.core import Karton, Task, Resource
class GenericUnpacker(Karton):
"""
Performs sample unpacking
"""
identity = "karton.generic-unpacker"
filters = [
{
"type": "sample",
"kind": "runnable",
"platform": "win32"
}
]
def process(self, task: Task) -> None:
# Get sample object
packed_sample = task.get_resource('sample')
# Log with self.log
self.log.info(f"Hi {packed_sample.name}, let me analyze you!")
...
# Send our results for further processing or reporting
task = Task(
{
"type": "sample",
"kind": "raw"
}, payload = {
"parent": packed_sample,
"sample": Resource(filename, unpacked)
})
self.send_task(task)
if __name__ == "__main__":
GenericUnpacker.main()
Some Karton systems are universal and useful to everyone. We decided to share them with the community.
This repository. It contains the karton.system
service - main service, responsible for dispatching tasks within the system. It also contains the karton.core
module, that is used as a library by other systems.
A small Flask dashboard for task and queue management and monitoring.
The "router". It recognises samples/files and produces various task types depending on the file format. Thanks to this, other systems may only listen for tasks with a specific format (for example, only zip
archives).
Generic archive unpacker. Archives uploaded into the system will be extracted, and every file will be processed individually.
Malware extractor. It uses Yara rules and Python modules to extract static configuration from malware samples and analyses. It's a fishing rod, not a fish - we don't share the modules themselves. But it's easy to write your own!
A very important part of the pipeline. Reporter submits all files, tags, comments and other intel produced during the analysis to MWDB. If you don't use MWDB yet or just prefer other backends, it's easy to write your own reporter.
Automatically runs Yara rules on all files in the pipeline, and tags samples appropriately. Rules not included ;).
Karton system that decodes files encoded with common methods, like hex
, base64
, etc. (You wouldn't believe how common it is).
A small wrapper around AutoIt-Ripper that extracts embedded AutoIt scripts and resources from compiled AutoIt executables.
Automated black-box malware analysis system with DRAKVUF engine under the hood, which does not require an agent on guest OS.
This is how these systems can be used to form a basic malware analysis pipeline:
A modular Karton Framework service that unpacks common packers like UPX and others using the Qiling Framework.
A Feature Rich Modular Malware Configuration Extraction Utility for MalDuck
Detect-It-Easy classifier for the Karton framework
RetDec unpacker module for the Karton framework
Malware similarity platform with modularity in mind.