DEV Community

# 75daysofllm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Day:30 Reformer: Efficient Transformer for Large Scale Models

Day:30 Reformer: Efficient Transformer for Large Scale Models

Comments
3 min read
Day 29: Sparse Transformers: Efficient Scaling for Large Language Models

Day 29: Sparse Transformers: Efficient Scaling for Large Language Models

Comments
3 min read
Day 27: Regularization Techniques for Large Language Models (LLMs)

Day 27: Regularization Techniques for Large Language Models (LLMs)

Comments
2 min read
Day 26: Learning Rate Schedules

Day 26: Learning Rate Schedules

Comments
2 min read
Day 22: Distributed Training in Large Language Models

Day 22: Distributed Training in Large Language Models

Comments
3 min read
Ethical Considerations in LLM Development and Deployment

Ethical Considerations in LLM Development and Deployment

1
Comments
2 min read
Retrieval-Augmented Generation (RAG) in LLMs

Retrieval-Augmented Generation (RAG) in LLMs

Comments
3 min read
Day 28: Model Compression Techniques for Large Language Models (LLMs)

Day 28: Model Compression Techniques for Large Language Models (LLMs)

1
Comments
2 min read
Exploring ELECTRA - Efficient Pre-training for Transformers

Exploring ELECTRA - Efficient Pre-training for Transformers

1
Comments
4 min read
Day: 25 Optimizer Algorithms for Large Language Models (LLMs)

Day: 25 Optimizer Algorithms for Large Language Models (LLMs)

9
Comments
3 min read
Day 14 of GPT: Deep Dive into Generative Pre-trained Transformers

Day 14 of GPT: Deep Dive into Generative Pre-trained Transformers

Comments
3 min read
BERT: Revolutionizing Natural Language Processing

BERT: Revolutionizing Natural Language Processing

Comments
3 min read
Day 24: Gradient Accumulation

Day 24: Gradient Accumulation

8
Comments
2 min read
Understanding Self-Attention and Multi-Head Attention in Deep Learning

Understanding Self-Attention and Multi-Head Attention in Deep Learning

Comments
4 min read
GPT-2 and GPT-3: The Evolution of Language Models

GPT-2 and GPT-3: The Evolution of Language Models

1
Comments
4 min read
T5 (Text-to-Text Transfer Transformer)

T5 (Text-to-Text Transfer Transformer)

1
Comments
4 min read
Prompt Engineering

Prompt Engineering

2
Comments 2
3 min read
Few-shot Learning: Teaching AI with Minimal Data

Few-shot Learning: Teaching AI with Minimal Data

Comments 1
5 min read
Understanding the Attention Mechanism in Natural Language Processing

Understanding the Attention Mechanism in Natural Language Processing

1
Comments
3 min read
Long Short-Term Memory (LSTM) Networks

Long Short-Term Memory (LSTM) Networks

Comments
4 min read
Understanding Recurrent Neural Networks (RNNs)

Understanding Recurrent Neural Networks (RNNs)

Comments
3 min read
Fine-tuning BERT: Unlocking the Power of Pre-trained Language Models

Fine-tuning BERT: Unlocking the Power of Pre-trained Language Models

1
Comments 2
4 min read
Positional Encoding: Adding Sequence Awareness to Transformers

Positional Encoding: Adding Sequence Awareness to Transformers

1
Comments
4 min read
Transformer Architecture: Revolutionizing NLP

Transformer Architecture: Revolutionizing NLP

2
Comments
3 min read
Backpropagation and Optimization in Neural Networks

Backpropagation and Optimization in Neural Networks

Comments
3 min read
Understanding Neural Networks: A Detailed Exploration

Understanding Neural Networks: A Detailed Exploration

Comments
5 min read
Detailed Introduction to Word Embedding

Detailed Introduction to Word Embedding

Comments
4 min read
Text Preprocessing for NLP

Text Preprocessing for NLP

Comments
3 min read
Introduction to NLP

Introduction to NLP

1
Comments
3 min read
loading...