Data Insights is a tool that leverages artificial intelligence to enhance data analysis and processing capabilities. Through natural language interaction, it empowers users to unlock valuable insights from their data effortlessly, without requiring deep programming expertise. Whether you're a beginner in data analysis or a seasoned professional, this tool offers an efficient and intuitive experience.
-
Talk to Excel
Simply ask questions in natural language, such as "Show me the top five products by sales," and instantly extract insights from Excel files. -
Interact with Databases
Connect to databases and retrieve results with straightforward commands, eliminating the need for complex SQL queries.
Data Insights is built upon powerful open-source technologies:
- Pandas AI: An extension of the Pandas library, enabling natural language queries for data analysis.
- Ollama: A framework for running large language models locally, powering the AI-driven interactions.
- DeepSeek Code: A coding assistant that enhances development efficiency and supports the project’s backend logic.
- Streamlit: A Python framework for creating interactive web applications, making it easy to build and share data-driven interfaces.
To get started with Data Insights, you'll need to set up a Python environment. We recommend using MiniConda for management. Follow these steps:
Download and install MiniConda, a lightweight Python environment manager.
Run the following command in your terminal to create a dedicated Python 3.11 environment:
conda create -n data_insights python=3.11
The project dependencies are listed in the requirements.txt
file. To install them, ensure your environment is activated, then run the following command in your terminal:
pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple
Follow these steps to set up Ollama on your local machine and get started with the DeepSeek Coder model:
-
Download Ollama:
Visit the official Ollama website and download the installer for your operating system (Windows, macOS, or Linux). -
Launch Ollama:
After installation, open your terminal or command prompt and start the Ollama service by running: -
Pull the DeepSeek Model:
Download the DeepSeek Coder model by executing the following command in your terminal:ollama pull deepseek-coder-v2 or ollama run deepseek-coder-v2
streamlit run data_insights_excel.py