Transforming CTI into Mitre ATT&CK Flows

This GitHub repository houses a cutting-edge project that leverages Retrieval Augmented Generation (RAG) to identify dependencies between Red Team attack techniques. The system utilizes MistralAI, in conjunction with LangChain, to perform advanced analysis and reasoning for the identification of unsecured assets in the realm of cyber threat intelligence.

Overview

This initiative introduces a comprehensive framework to convert Cyber Threat Intelligence (CTI) data into flows compatible with the Mitre ATT&CK framework. The conversion process spans multiple steps, and the ensuing guide facilitates seamless setup and execution within this framework.

Prerequisites

Access to a Google Colab account for running the associated notebooks.

Getting Approval for Llama 2

Before initiating the notebook, ensure you have a Pinecone account and the necessary approval for using the Llama 2 model.

Step 1: Fill in the Llama 2 Access Request Form

Complete the Llama access request form, specifying the need for both the Llama 2 and Llama Chat models. Use the email associated with your HuggingFace account.
Typically, approval emails are received within an hour.

Step 2: Request Access to the Llama 2 Model

Visit the Llama 2 13B Chat model page.
Submit the request form for downloading the model.
Approval is generally received within an hour.

Configuring RAG with Llama 2

After securing approval, follow these steps to set up the notebook for Retrieval-Augmented Generation (RAG) with Llama 2. Replace three key strings as indicated throughout the notebook:

PINECONE_API_KEY: Obtain from your Pinecone account.
PINECONE_ENV: Extract from Pinecone under the Environment header.
HF_AUTH_TOKEN: Generate or use an existing token from the Access token page.

Pinecone API Key

Create a Pinecone account if you don't have one.
Sign in and navigate to API Keys on the right panel.
Copy the PINECONE_API_KEY using the designated button.

Pinecone Environment

Copy the Pinecone environment (PINECONE_ENV) under the Environment header.

Hugging Face Authorization Token

Generate a new token or use an existing one from the Access token page.

Executing the Conversion Process

Step 1: Summarize and Run TRAM

Open the notebook Tram2flow_fin.ipynb.
Execute the code within the notebook.
Follow the prompts and instructions to summarize and run TRAM.
Save the output for further analysis.

Step 2: Run LLM Analysis

Open the notebook operator.ipynb.
Execute the code within the notebook.
Follow the prompts and instructions to perform LLM analysis.
Save the generated results for subsequent steps.

Step 3: Run Convert DataFrame to STIX

Open the notebook LLM_output_to_Image.ipynb.
Execute the code within the notebook.
Follow the prompts and instructions to convert DataFrame to STIX format.
Save the generated STIX file.

Step 4: Run Convert STIX to PNG

Open the notebook Json_to_PNG.ipynb.
Execute the code within the notebook.
Follow the prompts and instructions to convert STIX to PNG images.
Save the generated PNG files representing Mitre ATT&CK Flows.

Conclusion

Follow these outlined steps to successfully convert CTI data into Mitre ATT&CK Flow representations. Ensure you save the outputs at each step for future reference and analysis.

Issues and Support

For any concerns or inquiries, kindly open an issue in this repository.

License

This project operates under the MIT License.

khanhgn/CTI2FLOWS