Welcome!

Unlock your personalized experience.

Posts

Is the future of AI open or closed? Watch today’s Princeton-Stanford workshop

Is the future of AI open or closed? Watch today’s Pr...

admin

Sep 12, 2024

0

1.6

By Sayash Kapoor, Rishi Bommasani, Percy Liang, Arvind Narayanan Perhaps the ...

How to Migrate From MLflow to Neptune

How to Migrate From MLflow to Neptune

admin

Sep 12, 2024

0

2.1

MLflow is a framework widely used for its experiment-tracking capabilities, b...

Introducing Redesigned Navigation, Run Groups, Reports, and More

Introducing Redesigned Navigation, Run Groups, Repor...

admin

Sep 12, 2024

0

922

We’ve been working on these improvements for quite some time, so it’s excitin...

ML/AI Platform Build vs Buy Decision: What Factors to Consider

ML/AI Platform Build vs Buy Decision: What Factors t...

admin

Sep 12, 2024

0

611

An ML/AI platform provides a coherent collection of tools and frameworks to b...

LLM Training: RLHF and Its Alternatives

LLM Training: RLHF and Its Alternatives

admin

Sep 12, 2024

0

2.1

I frequently reference a process called Reinforcement Learning with Human Fee...

From Self-Alignment to LongLoRA

From Self-Alignment to LongLoRA

admin

Sep 12, 2024

0

212

Another month, another round of interesting research papers ranging from larg...

LLM Business and Busyness: Recent Company Investments and AI Adoption, New Small Openly Available LLMs, and LoRA Research

LLM Business and Busyness: Recent Company Investment...

admin

Sep 12, 2024

0

1.2

Discussing Recent Company Investments and AI Adoption, New Small Openly Avail...

AI and Open Source in 2023

AI and Open Source in 2023

admin

Sep 12, 2024

0

375

The Highs and Lows: A Year in Review

Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)

Practical Tips for Finetuning LLMs Using LoRA (Low-R...

admin

Sep 12, 2024

0

162

Things I Learned From Hundreds of Experiments

A Potential Successor to RLHF for Efficient LLM Alignment and the Resurgence of CNNs

A Potential Successor to RLHF for Efficient LLM Alig...

admin

Sep 12, 2024

0

177

From Vision Transformers to innovative large language model finetuning techni...

Tackling Hallucinations, Boosting Reasoning Abilities, and New Insights into the Transformer Architecture

Tackling Hallucinations, Boosting Reasoning Abilitie...

admin

Sep 12, 2024

0

1.8

This month, I want to focus on three papers that address three distinct probl...

Ten Noteworthy AI Research Papers of 2023

Ten Noteworthy AI Research Papers of 2023

admin

Sep 12, 2024

0

1.3

This year has felt distinctly different. I've been working in, on, and with m...

Understanding and Coding Self-Attention, Multi-Head Attention, Cross-Attention, and Causal-Attention in LLMs

Understanding and Coding Self-Attention, Multi-Head ...

admin

Sep 12, 2024

0

1.6

This article will teach you about self-attention mechanisms used in transform...

Model Merging, Mixtures of Experts, and Towards Smaller LLMs

Model Merging, Mixtures of Experts, and Towards Smal...

admin

Sep 12, 2024

0

2.1

Model Merging, Mixtures of Experts, and Towards Smaller LLMs

Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch

Improving LoRA: Implementing Weight-Decomposed Low-R...

admin

Sep 12, 2024

0

1.4

Low-rank adaptation (LoRA) is a machine learning technique that modifies a pr...

Tips for LLM Pretraining and Evaluating Reward Models

Tips for LLM Pretraining and Evaluating Reward Models

admin

Sep 12, 2024

0

753

Discussing AI Research Papers in March 2024