Add a layer of caching to NLP server methods

Question

gcampax opened this issue 4 years ago · 0 comments

To reduce latency and compute costs, we should briefly cache all NLU and TTS requests in some in memory caching system, like a Redis or memcached.