Welcome!

Unlock your personalized experience.

admin

Last seen: 23 hours ago

Member since Sep 06, 2024

[email protected]

AI leaderboards are no longer useful. It's time to switch to Pareto curves.

AI leaderboards are no longer useful. It's time to s...

admin

Sep 12, 2024

0

485

What spending $2,000 can tell us about evaluating AI agents

Scientists should use AI as a tool, not an oracle

Scientists should use AI as a tool, not an oracle

admin

Sep 12, 2024

0

113

How AI hype leads to flawed research that fuels more hype

AI scaling myths

AI scaling myths

admin

Sep 12, 2024

0

811

Scaling will run out. The question is when.

New paper: AI agents that matter

New paper: AI agents that matter

admin

Sep 12, 2024

0

2.1

Rethinking AI agent benchmarking and evaluation

AI existential risk probabilities are too unreliable to inform policy

AI existential risk probabilities are too unreliable...

admin

Sep 12, 2024

0

822

How speculation gets laundered through pseudo-quantification

ML/AI Platform Build vs Buy Decision: What Factors to Consider

ML/AI Platform Build vs Buy Decision: What Factors t...

admin

Sep 12, 2024

0

611

An ML/AI platform provides a coherent collection of tools and frameworks to b...

Introducing Redesigned Navigation, Run Groups, Reports, and More

Introducing Redesigned Navigation, Run Groups, Repor...

admin

Sep 12, 2024

0

922

We’ve been working on these improvements for quite some time, so it’s excitin...

How to Migrate From MLflow to Neptune

How to Migrate From MLflow to Neptune

admin

Sep 12, 2024

0

2.1

MLflow is a framework widely used for its experiment-tracking capabilities, b...

Building LLM Applications With Vector Databases

Building LLM Applications With Vector Databases

admin

Sep 12, 2024

0

1.6

As a Machine Learning Engineer working with many companies, I repeatedly enco...

Adversarial Machine Learning: Defense Strategies

Adversarial Machine Learning: Defense Strategies

admin

Sep 12, 2024

0

350

The growing prevalence of ML models in business-critical applications results...

3 Takes on End-to-End For the MLOps Stack: Was It Worth It?

3 Takes on End-to-End For the MLOps Stack: Was It Wo...

admin

Sep 12, 2024

0

822

As machine learning (ML) drives innovation across industries, organizations s...

LLM Observability: Fundamentals, Practices, and Tools

LLM Observability: Fundamentals, Practices, and Tools

admin

Sep 12, 2024

0

457

Large Language Models (LLMs) have become the driving force behind AI-powered ...

Observability in LLMOps: Different Levels of Scale

Observability in LLMOps: Different Levels of Scale

admin

Sep 12, 2024

0

475

Observability is invaluable in LLMOps. Whether we’re talking about pretrainin...

LLM Evaluation For Text Summarization

LLM Evaluation For Text Summarization

admin

Sep 12, 2024

0

1.4

Text summarization is a prime use case of LLMs (Large Language Models). It ai...

Strategies For Effective Prompt Engineering

Strategies For Effective Prompt Engineering

admin

Sep 12, 2024

0

620

When I first delved into machine learning, prompt engineering seemed like a n...

LLM For Structured Data

LLM For Structured Data

admin

Sep 12, 2024

0

2.1

It is estimated that 80% to 90% of the data worldwide is unstructured. Howeve...