Abstracts: October 9, 2023
Listen now
Description
Members of the research community at Microsoft work continuously to advance their respective fields. Abstracts brings its audience to the cutting edge with them through short, compelling conversations about new and noteworthy achievements. In this episode, Dr. Sheng Zhang (https://www.microsoft.com/en-us/research/people/shezhan/), a Senior Researcher at Microsoft Research, joins host Dr. Gretchen Huizinga to discuss “UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition (https://www.microsoft.com/en-us/research/publication/universalner-targeted-distillation-from-large-language-models-for-open-named-entity-recognition/).” In this paper, Zhang and his coauthors present mission-focused instruction tuning, a method for distilling large language models into smaller, more efficient ones for a broad application class. Their UniversalNER models achieved state-of-the-art performance in named entity recognition, an important natural language processing (NLP) task. Model distillation has the potential to make NLP and other capabilities more accessible, particularly in specialized domains such as biomedicine, which could benefit from more resource-efficient and transparent options. Learn more:* View the paper (https://www.microsoft.com/en-us/research/publication/universalner-targeted-distillation-from-large-language-models-for-open-named-entity-recognition/)* UniversalNER project website with demo (https://universal-ner.github.io/)* Code on GitHub (https://github.com/universal-ner/universal-ner)* Dataset and models on Hugging Face (https://huggingface.co/Universal-NER)
More Episodes
Research manager Karin Strauss and members of the DNA Data Storage Project reflect on the path to developing a synthetic DNA–based system for archival data storage, including the recent open-source release of its most powerful algorithm for DNA error correction.Get the Trellis BMA code: GitHub -...
Published 11/19/24
Published 11/19/24
The efficient simulation of molecules has the potential to change how the world understands biological systems and designs new drugs and biomaterials. Tong Wang discusses AI2BMD, an AI-based system designed to simulate large biomolecules with speed and accuracy.Read the paperGet the code
Published 11/14/24