Posts

Is the future of AI open or closed? Watch today’s Pr...

By Sayash Kapoor, Rishi Bommasani, Percy Liang, Arvind Narayanan Perhaps the ...

How to Migrate From MLflow to Neptune

MLflow is a framework widely used for its experiment-tracking capabilities, b...

Introducing Redesigned Navigation, Run Groups, Repor...

We’ve been working on these improvements for quite some time, so it’s excitin...

ML/AI Platform Build vs Buy Decision: What Factors t...

An ML/AI platform provides a coherent collection of tools and frameworks to b...

LLM Training: RLHF and Its Alternatives

I frequently reference a process called Reinforcement Learning with Human Fee...

From Self-Alignment to LongLoRA

Another month, another round of interesting research papers ranging from larg...

LLM Business and Busyness: Recent Company Investment...

Discussing Recent Company Investments and AI Adoption, New Small Openly Avail...

AI and Open Source in 2023

The Highs and Lows: A Year in Review

Practical Tips for Finetuning LLMs Using LoRA (Low-R...

Things I Learned From Hundreds of Experiments

A Potential Successor to RLHF for Efficient LLM Alig...

From Vision Transformers to innovative large language model finetuning techni...

Tackling Hallucinations, Boosting Reasoning Abilitie...

This month, I want to focus on three papers that address three distinct probl...

Ten Noteworthy AI Research Papers of 2023

This year has felt distinctly different. I've been working in, on, and with m...

Understanding and Coding Self-Attention, Multi-Head ...

This article will teach you about self-attention mechanisms used in transform...

Model Merging, Mixtures of Experts, and Towards Smal...

Model Merging, Mixtures of Experts, and Towards Smaller LLMs

Improving LoRA: Implementing Weight-Decomposed Low-R...

Low-rank adaptation (LoRA) is a machine learning technique that modifies a pr...

Tips for LLM Pretraining and Evaluating Reward Models

Discussing AI Research Papers in March 2024