ExperimentaList

GPT-4 Turbo

Chain-of-Thought Prompting Optimization on MMLU Benchmark

+12.3% Accuracy Chain-of-Thought

View Results ->

2h

Claude 3.5 Sonnet

RAG vs Fine-tuning: Document Q&A Performance Comparison

67% Cost Reduction RAG

View Results ->

5h

Llama 3.1 70B

Temperature Sweep Analysis for Code Generation Tasks

Optimal T=0.2 Temperature Tuning

View Results ->

1d

Gemini 1.5 Pro

Long Context Window Performance: 1M Token Stress Test

98.7% Needle Recall Context Window

View Results ->

2d

GPT-3.5 Turbo

Few-Shot Learning: 0-shot vs 5-shot Comparison

+34% Accuracy Few-Shot

View Results ->

3d

Mistral Large

System Prompt Engineering for JSON Output Reliability

99.2% Valid JSON Prompt Optimization

View Results ->

4d

Claude 3 Opus

Multi-Modal Vision Analysis: Image Understanding Benchmarks

89.4% Accuracy Multi-Modal

View Results ->

5d

GPT-4o

Latency Optimization: Streaming vs Batch Response Times

-42% Latency Performance

View Results ->

1w

Llama 3.2 3B

Quantization Study: INT8 vs FP16 Trade-offs

4x Faster Inference Optimization

View Results ->

2w

Medium

Senior Client Engineer (React & React Native)

$55K - $100K Remote

Apply Now ->

1d

Twitch

Contract React Native Engineer

Full Time Remote

Apply Now ->

2d

Figma

QA Automation Engineer

Full Time Remote

Apply Now ->

2d

Figma

Senior Marketing Program Manager

Full Time Remote

Apply Now ->

2d

Figma

Senior Product Designer

Full Time Remote

Apply Now ->

2d

Facebook

Remote Cyber Security Analyst US

$55K - $100K United States

Apply Now ->

2d