Doctor Paper

Paper List

Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains

Published: 2025-01-10, Tags:

Small Language Models (SLMs) Can Still Pack a Punch: A survey

Published: 2025-01-03, Tags:

Qwen2.5 Technical Report

Published: 2024-12-19, Tags:

Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks

Published: 2022-11-22, Tags:

AgentInstruct: Toward Generative Teaching with Agentic Flows

Published: 2024-07-03, Tags:

Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks

Published: 2024-11-07, Tags:

KTO: Model Alignment as Prospect Theoretic Optimization

Published: 2024-02-02, Tags:

A Survey on Federated Recommendation Systems

Published: 2022-12-27, Tags:

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Published: 2024-11-04, Tags:

ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate

Published: 2024-11-05, Tags:

Zipfian Whitening

Published: 2024-11-01, Tags:

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Published: 2023-05-03, Tags:

LIMA: Less Is More for Alignment

Published: 2023-05-18, Tags:

On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability

Published: 2024-09-30, Tags:

Revealing the Barriers of Language Agents in Planning

Published: 2024-10-16, Tags:

Competition-Level Code Generation with AlphaCode

Published: 2022-02-08, Tags:

Flamingo: a Visual Language Model for Few-Shot Learning

Published: 2022-04-29, Tags:

Gemma 2: Improving Open Language Models at a Practical Size

Published: 2024-07-31, Tags:

Thinking LLMs: General Instruction Following with Thought Generation

Published: 2024-10-14, Tags:

Training-Free Long-Context Scaling of Large Language Models

Published: 2024-02-27, Tags:

Prover-Verifier Games improve legibility of LLM outputs

Published: 2024-07-18, Tags:

RAFT: Adapting Language Model to Domain Specific RAG

Published: 2024-03-15, Tags:

Qwen2 Technical Report

Published: 2024-07-15, Tags:

Mixture-of-Agents Enhances Large Language Model Capabilities

Published: 2024-06-07, Tags:

MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains

Published: 2024-07-18, Tags:

Enhancing Emotional Generation Capability of Large Language Models via Emotional Chain-of-Thought

Published: 2024-01-12, Tags:

SWAG: Storytelling With Action Guidance

Published: 2024-02-05, Tags:

The Prompt Report: A Systematic Survey of Prompting Techniques

Published: 2024-06-06, Tags:

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs

Published: 2024-06-03, Tags:

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Published: 2024-05-20, Tags:

xLSTM: Extended Long Short-Term Memory

Published: 2024-05-07, Tags:

An Early Categorization of Prompt Injection Attacks on Large Language Models

Published: 2024-01-31, Tags:

Automatic and Universal Prompt Injection Attacks against Large Language Models

Published: 2024-03-07, Tags:

Prompt Injection attack against LLM-integrated Applications

Published: 2023-06-08, Tags:

Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks

Published: 2023-05-24, Tags:

Better & Faster Large Language Models via Multi-token Prediction

Published: 2024-04-30, Tags:

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Published: 2024-04-19, Tags:

Rho-1: Not All Tokens Are What You Need

Published: 2024-04-11, Tags:

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Published: 2024-04-10, Tags:

Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws

Published: 2024-04-08, Tags:

Evolutionary Optimization of Model Merging Recipes

Published: 2024-03-19, Tags:

Understanding Why Label Smoothing Degrades Selective Classification and How to Fix It

Published: 2024-03-19, Tags:

SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Published: 2023-12-23, Tags:

Stealing Part of a Production Language Model

Published: 2024-03-11, Tags:

Scattered Mixture-of-Experts Implementation

Published: 2024-03-13, Tags:

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

Published: 2024-03-08, Tags:

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Published: 2024-03-14, Tags:

Is Cosine-Similarity of Embeddings Really About Similarity?

Published: 2024-03-08, Tags:

Instruction Tuning for Large Language Models: A Survey

Published: 2023-08-21, Tags:

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Published: 2024-03-06, Tags:

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Published: 2024-02-12, Tags:

Graph Mamba: Towards Learning on Graphs with State Space Models

Published: 2024-02-13, Tags:

Mixtures of Experts Unlock Parameter Scaling for Deep RL

Published: 2024-02-13, Tags:

Scaling Laws for Fine-Grained Mixture of Experts

Published: 2024-02-12, Tags:

Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation

Published: 2021-08-27, Tags:

Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration

Published: 2024-01-25, Tags:

Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

Published: 2023-06-09, Tags:

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Published: 2021-06-11, Tags:

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Published: 2023-07-31, Tags:

Self-Rewarding Language Models

Published: 2024-01-18, Tags:

Fast Inference of Mixture-of-Experts Language Models with Offloading

Published: 2023-12-28, Tags:

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy

Published: 2023-10-02, Tags:

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Published: 2024-01-11, Tags:

Textbooks Are All You Need

Published: 2023-06-20, Tags: LLM, Model

LLM Augmented LLMs: Expanding Capabilities through Composition

Published: 2024-01-04, Tags:

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Published: 2024-01-02, Tags:

A Review of Sparse Expert Models in Deep Learning

Published: 2022-09-04, Tags: survey

Mixtral of Experts

Published: 2024-01-08, Tags: Model

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Published: 2024-01-02, Tags:

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Published: 2023-06-01, Tags: Model

LLM Factoscope: Uncovering LLMs' Factual Discernment through Inner States Analysis

Published: 2023-12-27, Tags:

Improving Text Embeddings with Large Language Models

Published: 2023-12-31, Tags:

Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4

Published: 2023-12-26, Tags:

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Published: 2022-05-27, Tags:

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Published: 2023-05-29, Tags:

Retrieval-Augmented Generation for Large Language Models: A Survey

Published: 2023-12-18, Tags: LLM, RAG

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

Published: 2023-12-15, Tags:

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Published: 2023-12-11, Tags:

Vision-Language Models as a Source of Rewards

Published: 2023-12-14, Tags:

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation

Published: 2023-12-05, Tags:

Prompting Frameworks for Large Language Models: A Survey

Published: 2023-11-21, Tags:

Accelerating Large Language Model Decoding with Speculative Sampling

Published: 2023-02-02, Tags:

SelfEval: Leveraging the discriminative nature of generative models for evaluation

Published: 2023-11-17, Tags:

FinMe: A Performance-Enhanced Large Language Model Trading Agent with Layered Memory and Character Design

Published: 2023-11-23, Tags:

Llama 2: Open Foundation and Fine-Tuned Chat Models

Published: 2023-07-18, Tags:

Simplifying Transformer Blocks

Published: 2023-11-03, Tags:

Zephyr: Direct Distillation of LM Alignment

Published: 2023-10-25, Tags:

System 2 Attention (is something you might need too)

Published: 2023-11-20, Tags:

Orca 2: Teaching Small Language Models How to Reason

Published: 2023-11-18, Tags:

Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback

Published: 2023-02-24, Tags:

When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories

Published: 2022-12-20, Tags:

PDFTriage: Question Answering over Long, Structured Documents

Published: 2023-09-16, Tags:

Retrieve Anything To Augment Large Language Models

Published: 2023-10-11, Tags:

Active Retrieval Augmented Generation

Published: 2023-05-11, Tags:

Improving Retrieval-Augmented Large Language Models via Data Importance Learning

Published: 2023-07-06, Tags: LLM, RAG

Making Retrieval-Augmented Language Models Robust to Irrelevant Context

Published: 2023-10-02, Tags:

Merging Generated and Retrieved Knowledge for Open-Domain QA

Published: 2023-10-22, Tags:

Generate rather than Retrieve: Large Language Models are Strong Context Generators

Published: 2022-09-21, Tags:

Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model

Published: 2023-10-14, Tags:

A Study on the Efficiency and Generalization of Light Hybrid Retrievers

Published: 2022-10-04, Tags: