Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains
Published: 2025-01-10, Tags:
Small Language Models (SLMs) Can Still Pack a Punch: A survey
Published: 2025-01-03, Tags:
Qwen2.5 Technical Report
Published: 2024-12-19, Tags:
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks
Published: 2022-11-22, Tags:
AgentInstruct: Toward Generative Teaching with Agentic Flows
Published: 2024-07-03, Tags:
Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks
Published: 2024-11-07, Tags:
KTO: Model Alignment as Prospect Theoretic Optimization
Published: 2024-02-02, Tags:
A Survey on Federated Recommendation Systems
Published: 2022-12-27, Tags:
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Published: 2024-11-04, Tags:
ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate
Published: 2024-11-05, Tags:
Zipfian Whitening
Published: 2024-11-01, Tags:
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Published: 2023-05-03, Tags:
LIMA: Less Is More for Alignment
Published: 2023-05-18, Tags:
On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability
Published: 2024-09-30, Tags:
Revealing the Barriers of Language Agents in Planning
Published: 2024-10-16, Tags:
Competition-Level Code Generation with AlphaCode
Published: 2022-02-08, Tags:
Flamingo: a Visual Language Model for Few-Shot Learning
Published: 2022-04-29, Tags:
Gemma 2: Improving Open Language Models at a Practical Size
Published: 2024-07-31, Tags:
Thinking LLMs: General Instruction Following with Thought Generation
Published: 2024-10-14, Tags:
Training-Free Long-Context Scaling of Large Language Models
Published: 2024-02-27, Tags:
Prover-Verifier Games improve legibility of LLM outputs
Published: 2024-07-18, Tags:
RAFT: Adapting Language Model to Domain Specific RAG
Published: 2024-03-15, Tags:
Qwen2 Technical Report
Published: 2024-07-15, Tags:
Mixture-of-Agents Enhances Large Language Model Capabilities
Published: 2024-06-07, Tags:
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Enhancing Emotional Generation Capability of Large Language Models via Emotional Chain-of-Thought
Published: 2024-01-12, Tags:
SWAG: Storytelling With Action Guidance
Published: 2024-02-05, Tags:
The Prompt Report: A Systematic Survey of Prompting Techniques
Published: 2024-06-06, Tags:
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
Published: 2024-06-03, Tags:
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
Published: 2024-05-20, Tags:
xLSTM: Extended Long Short-Term Memory
Published: 2024-05-07, Tags:
An Early Categorization of Prompt Injection Attacks on Large Language Models
Published: 2024-01-31, Tags:
Automatic and Universal Prompt Injection Attacks against Large Language Models
Published: 2024-03-07, Tags:
Prompt Injection attack against LLM-integrated Applications
Published: 2023-06-08, Tags:
Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks
Published: 2023-05-24, Tags:
Better & Faster Large Language Models via Multi-token Prediction
Published: 2024-04-30, Tags:
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
Published: 2024-04-19, Tags:
Rho-1: Not All Tokens Are What You Need
Published: 2024-04-11, Tags:
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Published: 2024-04-10, Tags:
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws
Published: 2024-04-08, Tags:
Evolutionary Optimization of Model Merging Recipes
Published: 2024-03-19, Tags:
Understanding Why Label Smoothing Degrades Selective Classification and How to Fix It
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
Published: 2023-12-23, Tags:
Stealing Part of a Production Language Model
Published: 2024-03-11, Tags:
Scattered Mixture-of-Experts Implementation
Published: 2024-03-13, Tags:
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Published: 2024-03-08, Tags:
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Published: 2024-03-14, Tags:
Is Cosine-Similarity of Embeddings Really About Similarity?
Instruction Tuning for Large Language Models: A Survey
Published: 2023-08-21, Tags:
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Published: 2024-03-06, Tags:
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Published: 2024-02-12, Tags:
Graph Mamba: Towards Learning on Graphs with State Space Models
Published: 2024-02-13, Tags:
Mixtures of Experts Unlock Parameter Scaling for Deep RL
Scaling Laws for Fine-Grained Mixture of Experts
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
Published: 2021-08-27, Tags:
Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration
Published: 2024-01-25, Tags:
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Published: 2023-06-09, Tags:
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Published: 2021-06-11, Tags:
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
Published: 2023-07-31, Tags:
Self-Rewarding Language Models
Published: 2024-01-18, Tags:
Fast Inference of Mixture-of-Experts Language Models with Offloading
Published: 2023-12-28, Tags:
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
Published: 2023-10-02, Tags:
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Published: 2024-01-11, Tags:
Textbooks Are All You Need
Published: 2023-06-20, Tags: LLM, Model
LLM Augmented LLMs: Expanding Capabilities through Composition
Published: 2024-01-04, Tags:
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Published: 2024-01-02, Tags:
A Review of Sparse Expert Models in Deep Learning
Published: 2022-09-04, Tags: survey
Mixtral of Experts
Published: 2024-01-08, Tags: Model
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Published: 2023-06-01, Tags: Model
LLM Factoscope: Uncovering LLMs' Factual Discernment through Inner States Analysis
Published: 2023-12-27, Tags:
Improving Text Embeddings with Large Language Models
Published: 2023-12-31, Tags:
Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4
Published: 2023-12-26, Tags:
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Published: 2022-05-27, Tags:
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Published: 2023-05-29, Tags:
Retrieval-Augmented Generation for Large Language Models: A Survey
Published: 2023-12-18, Tags: LLM, RAG
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Published: 2023-12-15, Tags:
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Published: 2023-12-11, Tags:
Vision-Language Models as a Source of Rewards
Published: 2023-12-14, Tags:
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
Published: 2023-12-05, Tags:
Prompting Frameworks for Large Language Models: A Survey
Published: 2023-11-21, Tags:
Accelerating Large Language Model Decoding with Speculative Sampling
Published: 2023-02-02, Tags:
SelfEval: Leveraging the discriminative nature of generative models for evaluation
Published: 2023-11-17, Tags:
FinMe: A Performance-Enhanced Large Language Model Trading Agent with Layered Memory and Character Design
Published: 2023-11-23, Tags:
Llama 2: Open Foundation and Fine-Tuned Chat Models
Published: 2023-07-18, Tags:
Simplifying Transformer Blocks
Published: 2023-11-03, Tags:
Zephyr: Direct Distillation of LM Alignment
Published: 2023-10-25, Tags:
System 2 Attention (is something you might need too)
Published: 2023-11-20, Tags:
Orca 2: Teaching Small Language Models How to Reason
Published: 2023-11-18, Tags:
Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback
Published: 2023-02-24, Tags:
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories
Published: 2022-12-20, Tags:
PDFTriage: Question Answering over Long, Structured Documents
Published: 2023-09-16, Tags:
Retrieve Anything To Augment Large Language Models
Published: 2023-10-11, Tags:
Active Retrieval Augmented Generation
Published: 2023-05-11, Tags:
Improving Retrieval-Augmented Large Language Models via Data Importance Learning
Published: 2023-07-06, Tags: LLM, RAG
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Merging Generated and Retrieved Knowledge for Open-Domain QA
Published: 2023-10-22, Tags:
Generate rather than Retrieve: Large Language Models are Strong Context Generators
Published: 2022-09-21, Tags:
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
Published: 2023-10-14, Tags:
A Study on the Efficiency and Generalization of Light Hybrid Retrievers
Published: 2022-10-04, Tags: