Goodbye everyone!
As you might have noticed, the landscape of LLMs has changed dramatically in recent months. With the introduction of models boasting a whopping 100,000+ tokens of context memory and the big players like OpenAI, Google, and Anthropic integrating RAG directly into their official websites, I feel like this repo has served its purpose.
I've also been focusing on other projects and prioritizing my personal life and mental well-being. As a result, I haven't been able to dedicate time to updating this repo, which is evident from the lack of commits in the past few months.
Looking back, this repo was initially created to address the limitations of the 4096 context GPT-3, as that context size was simply too small. However, with the advancements in LLMs and the emergence of better GUIs, I believe that higher context is now preferable to using embeddings for RAG.
If you're looking for some awesome LLM GUIs, here are my top recommendations:
- SillyTavern - Perfect for
bot fookingroleplaying and even has RAG support! - LobeChat - Great for general-purpose LLM usage, although it doesn't have RAG.
- BetterChatGPT - Another fantastic option for general-purpose LLM usage, specifically designed for OpenAI. While it's not actively maintained, the UI closely resembles the original ChatGPT website.
(Plz don't unstar this repoš„ŗ)
Run gpt-3.5-turbo
or any other GPT models(text-davinci-003
) with this program!
Use gpt-4
or gpt-4-32k
to use the new GPT-4 models if you have access.
You can switch models in the config.json
file.
It's like https://chat.openai.com/ but in your CMD and better(in terms of memory).
You can add custom initial prompts and save/load your chat history!
Download and double-click the GPT3Bot.exe
or run.bat
to run the program!
In Linux and macOS, you can run ./GPT3Bot
to run the program.
Click to download: Stable Release | Development Build
Please check the Wiki for more information: Click Me
- Long term memory support! Keep hitting the 4096 tokens context limit? Worry no more with this CLI Bot. It has nearly INFINITE context memory(If you have infinite disk space lol), all thanks to Embeddings! If you want to see how this program handles embeddings internally, set
debug_reference
totrue
inconfig.json
! - Q&A with custom documents support! You can load custom documents, and perform Q&A with them, please check the Wiki for more info.
- You can use
/stop
to end and save the chat. - You can use
/undo
to undo your last prompt. - You can use
/reset
to reset your entire chat. - You can use
/dump
to dump your chat history to a .txt file inside thedump
folder. - You can place .txt files in the "initial" folder to set different initial prompts, and you can use the filename to load it when you open the program. Simply directly press enter after you open the program, then enter the initial prompt file's name and press enter to load it.
- After you execute
/stop
, the program will ask you to input the filename to save. You can press enter directly to skip this and not save the chat. If you input any other text and then press enter, the chat will be saved into a json in the "saved" folder. When you open the program next time, you can simply input "s"(which means saved), press enter, then type the saved chat's json file's name to load your saved chat. - Easy config file in
config.json
, can be easily modified. - Unlike other bots, this one actually streams. This means it will display the output as soon as a token is sent from the API(Just like what ChatGPT's website is doing), no need to wait until the entire response is generated!
- When the response is being streamed, you can press Ctrl+C to cancel the stream.
- Automatically use the system proxy. Note: This feature is only supported on Windows, because there's a bug in my proxy library that causes it fail to compile on Linux and macOS.
- Multiline input support, you need to press Ctrl+N or Alt+Enter to enter a new line.
- Ctrl+V pasting support, you can paste text from your clipboard by pressing Ctrl+V.
- Full UTF-8 support, you can type in any language you want!
- Full of colors(If your terminal supports it)!
- Fine tune helper, you can fine tune base models more easily(Only for professional users).
- Auto translator, you can translate text files automatically.
Written in C++ (Libraries used: Boost, cURL, nlohmann/json, libproxy, cpp-terminal, ftxui, oneTBB, clip, cpp-tiktoken, pcre2, utf8proc)
- Windows 10/11 64-bit
- Linux 64-bit (Tested on Ubuntu 20.04 & CentOS 8) (Won't work on Ubuntu 18.04, CentOS 7 and lower, because they don't support C++17)
- macOS 64-bit (Didn't test, but it should work on macOS 12 and higher)