common

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merchevron-rightA Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesischevron-rightAlleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimationchevron-rightDeCoT: Debiasing Chain-of-Thought for Knowledge-Intensive Tasks in Large Language Models via Causalchevron-rightEvaluating Language Models as Synthetic Data Generatorschevron-rightFrom REAL to SYNTHETIC: Synthesizing Millions of Diversified and Complicated User Instructions withchevron-rightGenerative Reward Modeling via Synthetic Criteria Preference Learningchevron-rightHintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claimschevron-rightOCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Modelschevron-rightOn Synthetic Data Strategies for Domain-Specific Generative Retrievalchevron-rightRethinking Chain-of-Thought from the Perspective of Self-Trainingchevron-rightScaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generationchevron-rightSelf-Generated Critiques Boost Reward Modeling for Language Modelschevron-rightSpaRE: Enhancing Spatial Reasoning in Vision-Language Models with Synthetic Datachevron-rightTARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Datachevron-rightTheorem Prover as a Judge for Synthetic Data Generationchevron-rightTree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knchevron-rightTreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluationchevron-rightGroup Sequence Policy Optimizationchevron-rightSEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddingschevron-rightOn LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Surveychevron-right

Last updated