Dcas89's picture

Dcas89 PRO

Dcas89

AI & ML interests

None yet

Recent Activity

reacted to breitburg's post with ๐Ÿ”ฅ about 5 hours ago
I've been experimenting with "pure" model alignment. The core idea is to only train a verifiable version of a capacity until the model generalizes it to the non-verifiable version. For example, training the model on factual self-knowledge, like the model's scale, architecture, runtime situation, and being able to predict its own behavior, betting this generalizes to real introspection about states that do not. The same principle applies to general instruction following -- no training on subjective judgement, only verifiable claims and inferences, betting the skill generalizes to instructions where correctness is a matter of judgment. The primary alignment claim is that an identity and taste that will emerge this way will be much more robust and honest than hand-scripted ones (e.g. "As an AI language model..."). During the training, we should never teach it to make any subjective claims or invent experiences that we assume it has, like "I don't have taste" or "I'm not self-aware in the way you think", as well as no narration of internal states like "I'm curious now". The main threat, of course, is that we'll simply inherit the training distribution of all the things like "taste", and we'll get an average. However, with the recent research about the models' introspection abilities, it might be as well the case that we'll get something that's more honest than something that tries to adhere to a specific spec file. I'm posting new experimental models trained that way in this collection: https://huggingface.co/collections/breitburg/neue
reacted to Hari5115's post with ๐Ÿ”ฅ about 5 hours ago
Something quietly caught my attention while going through this week's trending papers โ€” Three independent teams, all converging on the same problem โ€” and all trending on HuggingFace right now. That kind of convergence usually means the field is quietly agreeing something needs fixing. The problem, as I understand it: Most of us are still stuffing conversation history into a context window and calling it memory. That's not memory โ€” that's a very expensive clipboard. ๐Ÿ“„ Three papers worth reading: ๐Ÿ”น SkillOpt โ€” Microsoft Research ( @Yifan Yang, @Ziyang Gong et all ) Agent skills stored as natural language that improves with real usage, no retraining needed โ†’ https://huggingface.co/papers/2605.23904 ๐Ÿ”น Mem0 โ€” Mem0 (YC S24, 41K โญ) ( @Prateek Chhikara & all ) Graph-structured memory that retrieves what's relevant not just what's recent โ†’ https://arxiv.org/abs/2504.19413 ๐Ÿ”น EverMemOS โ€” EverMind (@Chuanrui Hu, @Xingze Gao et all ) Separates memory into episodic, semantic and procedural types โ€” closer to how human memory actually works โ†’ https://huggingface.co/papers/2601.02163 Great work from all three teams ๐Ÿ™Œ ๐Ÿ’ฌ What are you actually using for agent memory in production today? Still ConversationBufferMemory, a custom schema, something else entirely? And do you think bigger context windows eventually make this problem disappear? #AgentMemory #MemoryAugmentedAgents #LongContextLLM #AgenticAI #LangChain #SkillOpt #Mem0 #EverMemOS
View all activity

Organizations

None yet