Ep. 255 - June 5, 2024
Listen now
Description
ArXiv NLP research for Wednesday, June 05, 2024. 00:19: Improving In-Context Learning with Prediction Feedback for Sentiment Analysis 01:24: MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering Medical Knowledge 03:01: Text Injection for Neural Contextual Biasing 04:16: 4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders 06:03: Adversarial Moment-Matching Distillation of Large Language Models 07:05: Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models 08:48: Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese 09:56: Evaluation of data inconsistency for multi-modal sentiment analysis 10:55: BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents 12:11: Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models 13:16: From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation 14:20: StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning 15:42: RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization 17:00: Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework 18:14: Cryptocurrency Frauds for Dummies: How ChatGPT introduces us to fraud? 19:48: FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models 20:59: Space Decomposition for Sentence Embedding 22:00: Towards Real-world Scenario: Imbalanced New Intent Discovery 23:40: Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation 25:20: CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs 27:03: StatBot.Swiss: Bilingual Open Data Exploration in Natural Language 28:10: Missci: Reconstructing Fallacies in Misrepresented Science 29:43: ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction 30:47: Linking Named Entities in Diderot's \textit{Encyclop\'edie} to Wikidata 32:06: Error-preserving Automatic Speech Recognition of Young English Learners' Language 33:37: Document-level Claim Extraction and Decontextualisation for Fact-Checking 34:45: The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches 36:09: LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback 37:39: IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models 39:46: Automating Turkish Educational Quiz Generation Using Large Language Models 41:34: Cycles of Thought: Measuring LLM Confidence through Stable Explanations 42:57: Are language models rational? The case of coherence norms and belief revision 43:58: What is the Best Way for ChatGPT to Translate Poetry? 45:20: Using Synchronic Definitions and Semantic Relations to Classify Semantic Change Types 46:14: MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization 47:09: BIPED: Pedagogically Informed Tutoring System for ESL Education 48:24: Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends 50:00: Wings: Learning Multimodal LLMs without Text-only Forgetting
More Episodes
ArXiv NLP research for Thursday, June 13, 2024. 00:20: Chain-of-Though (CoT) prompting strategies for medical error detection and correction 01:31: CoastTerm: a Corpus for Multidisciplinary Term Extraction in Coastal Scientific Literature 02:52: RH-SQL: Refined Schema and Hardness Prompt for...
Published 06/15/24
ArXiv NLP research for Thursday, June 13, 2024. 00:20: Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning 01:53: Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models 03:26: Automated Essay Scoring Using Grammatical Variety and...
Published 06/15/24
Published 06/15/24