Beyond the Warm Embrace: A Deeper Look at Hugging Face | by Zachary Raicik

Fine tuning language models for Named Entity Recognition

Hugging Face is a platform that offers tools and pre-trained models for various Natural Language Processing (NLP) and Natural Language Understanding (NLU) tasks. In our previous article, A Warm Embrace: Exploring Hugging Face, we dove into the basics of this platform and its open-source library that features implementations of many state-of-the-art transformer architectures. This post enhances the Hugging Face documentation by providing emerging data scientists with a single, connected view of various Hugging Face tools for a specific task. Specifically, this article explains how to piece together multiple Hugging Face capabilities to fine-tune an existing language model for named entity recognition (“NER”).

In this section, we briefly look at two foundational concepts essential for building our model. As a reminder, we covered Hugging Face basics in A Warm Embrace: Exploring Hugging Face.

Named Entity Recognition
Model Fine-tuning

In the sections below, it’s assumed you have some knowledge of model development and the associated concepts — however, if anything is unclear feel free to reach out!

Named Entity Recognition

Named Entity Recognition (“NER”) is a common natural language processing task of identifying and categorizing relevant information, or entities, into one of many predefined (named) groups. NER models can be trained on a variety of entities. Some of the most common ones are:

Names
Organizations
Dates
Places & Locations

In the image below, I manually tagged a couple of different named entities in a sample sentence. In the context of machine learning and NLP, NER is the process of automating this categorization process through models.

NER models can enable a variety of tasks including but not limited to, information retrieval, content summarization, content recommendation and machine translation.

Model Fine-Tuning

Source link

What's Hot

Microsoft Released LLM2CLIP: A New AI Technique in which a LLM Acts as a Teacher for CLIP’s Visual Encoder

This Machine Learning Paper Transforms Embodied AI Efficiency: New Scaling Laws for Optimizing Model and Dataset Proportions in Behavior Cloning and World Modeling Tasks

Gradient Boosting | Towards Data Science

Beyond the Warm Embrace: A Deeper Look at Hugging Face | by Zachary Raicik | Nov, 2023

Gradient Boosting | Towards Data Science

A Practical Framework for Data Analysis: 6 Essential Principles | by Pararawendy Indarjo | Nov, 2024

How I Created a Data Science Project Following CRISP-DM Lifecycle | by Gustavo Santos | Nov, 2024

Leave A Reply Cancel Reply

How ML AI Can Help Businesses Reduce Overhead Costs

How the AI Surge May Help Current WFH Employees

The ultimate contact center automation guide

Top 5AI Development Companies To Transform Your Business | by Amyra Sheldon

Microsoft Released LLM2CLIP: A New AI Technique in which a LLM Acts as a Teacher for CLIP’s Visual Encoder

This Machine Learning Paper Transforms Embodied AI Efficiency: New Scaling Laws for Optimizing Model and Dataset Proportions in Behavior Cloning and World Modeling Tasks

Gradient Boosting | Towards Data Science

No Train, All Gain: Enhancing Deep Frozen Representations with Self-Supervised Gradients

Our Picks

Microsoft Released LLM2CLIP: A New AI Technique in which a LLM Acts as a Teacher for CLIP’s Visual Encoder

This Machine Learning Paper Transforms Embodied AI Efficiency: New Scaling Laws for Optimizing Model and Dataset Proportions in Behavior Cloning and World Modeling Tasks

Gradient Boosting | Towards Data Science

What's Hot

Beyond the Warm Embrace: A Deeper Look at Hugging Face | by Zachary Raicik | Nov, 2023

Fine tuning language models for Named Entity Recognition

Named Entity Recognition

Model Fine-Tuning

Related Posts

Leave A Reply Cancel Reply