/Mass_Summarization

Large Scale Dataset Cleaning (Summarization and Information Extraction) Using LLAMA2 70B

Primary LanguageJupyter Notebook

Mass Summarization

This repository is dedicated to the task of mass summarization using large language models (LLMs), particularly focusing on leveraging the capabilities of llama models. The primary goal is to facilitate large-scale dataset cleaning and summarization to enhance data understanding and usability.

Overview

The Mass_Summarization repository contains notebooks that demonstrate the use of llama2 models for summarizing large datasets. These notebooks are designed to be used with Google Colab and Runpod, making it easy for users to run and modify the code according to their needs. mass_summary

Notebooks

  • LLAMA2_13B_Summarizer.ipynb: Demonstrates how to perform mass summarization using the llama2 13B model on Google Colab.
  • LLAMA_2_70B_Information_Extractor.ipynb: Shows the use of the llama2 70B model for information extraction on Runpod.