⌘Ctrlk

common

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Mer A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation DeCoT: Debiasing Chain-of-Thought for Knowledge-Intensive Tasks in Large Language Models via Causal Evaluating Language Models as Synthetic Data Generators From REAL to SYNTHETIC: Synthesizing Millions of Diversified and Complicated User Instructions with Generative Reward Modeling via Synthetic Criteria Preference Learning HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models On Synthetic Data Strategies for Domain-Specific Generative Retrieval Rethinking Chain-of-Thought from the Perspective of Self-Training Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Self-Generated Critiques Boost Reward Modeling for Language Models SpaRE: Enhancing Spatial Reasoning in Vision-Language Models with Synthetic Data TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data Theorem Prover as a Judge for Synthetic Data Generation Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Kn TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation Group Sequence Policy Optimization SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddings On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey

PreviousSynthetic Data NextA Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Mer

Last updated 5 months ago