#Small Language Models (SLMs)

3개의 포스트

[논문리뷰] Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling

arXiv에 게시된 'Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling' 논문에 대한 자세한 리뷰입니다.

#Review #Reasoning #Small Language Models (SLMs)#Hybrid Architecture #Test-Time Scaling (TTS)#Supervised Fine-Tuning (SFT)#Reinforcement Learning (RL)#DeepConf #Computational Efficiency

2026년 1월 5일

[논문리뷰] Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

arXiv에 게시된 'Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models' 논문에 대한 자세한 리뷰입니다.

#Review #Small Language Models (SLMs)#Latency Optimization #Hybrid Architectures #Evolutionary Search #Weight Normalization #Efficient Attention #Depth-Width Ratios #Real-device Efficiency

2025년 11월 30일

[논문리뷰] MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems

Feiyu Xiong이 arXiv에 게시한 'MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems' 논문에 대한 자세한 리뷰입니다.

#Review #Retrieval-Augmented Generation (RAG)#Document Memory #Text Chunking #Small Language Models (SLMs)#Large Language Models (LLMs)#Scenario-Aware Processing #Multi-Layer Retrieval #Cognitive Simulation

2025년 10월 17일