Welcome!

Unlock your personalized experience.

admin

Last seen: 23 hours ago

Member since Sep 06, 2024

[email protected]

Reinforcement Learning From Human Feedback (RLHF) For LLMs

Reinforcement Learning From Human Feedback (RLHF) Fo...

admin

Sep 12, 2024

0

933

Reinforcement Learning from Human Feedback (RLHF) has turned out to be the ke...

Is the future of AI open or closed? Watch today’s Princeton-Stanford workshop

Is the future of AI open or closed? Watch today’s Pr...

admin

Sep 12, 2024

0

1.6

By Sayash Kapoor, Rishi Bommasani, Percy Liang, Arvind Narayanan Perhaps the ...

Evaluating LLMs is a minefield

Evaluating LLMs is a minefield

admin

Sep 12, 2024

0

1.6

Annotated slides from a recent talk

How Transparent Are Foundation Model Developers?

How Transparent Are Foundation Model Developers?

admin

Sep 12, 2024

0

1.3

Introducing the Foundation Model Transparency Index

LLM Training: RLHF and Its Alternatives

LLM Training: RLHF and Its Alternatives

admin

Sep 12, 2024

0

2.1

I frequently reference a process called Reinforcement Learning with Human Fee...

From Self-Alignment to LongLoRA

From Self-Alignment to LongLoRA

admin

Sep 12, 2024

0

212

Another month, another round of interesting research papers ranging from larg...

LLM Business and Busyness: Recent Company Investments and AI Adoption, New Small Openly Available LLMs, and LoRA Research

LLM Business and Busyness: Recent Company Investment...

admin

Sep 12, 2024

0

1.2

Discussing Recent Company Investments and AI Adoption, New Small Openly Avail...

AI and Open Source in 2023

AI and Open Source in 2023

admin

Sep 12, 2024

0

375

The Highs and Lows: A Year in Review

A Potential Successor to RLHF for Efficient LLM Alignment and the Resurgence of CNNs

A Potential Successor to RLHF for Efficient LLM Alig...

admin

Sep 12, 2024

0

177

From Vision Transformers to innovative large language model finetuning techni...

Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)

Practical Tips for Finetuning LLMs Using LoRA (Low-R...

admin

Sep 12, 2024

0

162

Things I Learned From Hundreds of Experiments

Tackling Hallucinations, Boosting Reasoning Abilities, and New Insights into the Transformer Architecture

Tackling Hallucinations, Boosting Reasoning Abilitie...

admin

Sep 12, 2024

0

1.8

This month, I want to focus on three papers that address three distinct probl...

Ten Noteworthy AI Research Papers of 2023

Ten Noteworthy AI Research Papers of 2023

admin

Sep 12, 2024

0

1.3

This year has felt distinctly different. I've been working in, on, and with m...

Understanding and Coding Self-Attention, Multi-Head Attention, Cross-Attention, and Causal-Attention in LLMs

Understanding and Coding Self-Attention, Multi-Head ...

admin

Sep 12, 2024

0

1.6

This article will teach you about self-attention mechanisms used in transform...

Model Merging, Mixtures of Experts, and Towards Smaller LLMs

Model Merging, Mixtures of Experts, and Towards Smal...

admin

Sep 12, 2024

0

2.1

Model Merging, Mixtures of Experts, and Towards Smaller LLMs

Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch

Improving LoRA: Implementing Weight-Decomposed Low-R...

admin

Sep 12, 2024

0

1.4

Low-rank adaptation (LoRA) is a machine learning technique that modifies a pr...

Research Papers in February 2024: A LoRA Successor, Small Finetuned LLMs Vs Generalist LLMs, and Transparent LLM Research

Research Papers in February 2024: A LoRA Successor, ...

admin

Sep 12, 2024

0

1.7

Once again, this has been an exciting month in AI research. This month, I'm c...