EM-LLM: Human-Inspired Episodic Memory for Infinite Context LLMs github.com 23 points by jbotz 15 hours ago
MacsHeadroom 7 hours ago So, infinite context length by making it compute bound instead of memory bound. Curious how much longer this takes to run and when it makes sense to use vs RAG.
So, infinite context length by making it compute bound instead of memory bound. Curious how much longer this takes to run and when it makes sense to use vs RAG.