Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Building a Security Scanner for LLM Apps (promptfoo.dev)
7 points by danenania 12 hours ago | past | 3 comments
Will Agents Hack Everything? (promptfoo.dev)
2 points by mooreds 8 days ago | past | discuss
How to replicate the Claude Code attack with Promptfoo (promptfoo.dev)
6 points by typpo 26 days ago | past
How to replicate the Claude Code attack with Promptfoo (promptfoo.dev)
2 points by mooreds 28 days ago | past
Will agents hack everything? (promptfoo.dev)
6 points by danenania 32 days ago | past | 9 comments
Evaluating Political Bias in LLMs (promptfoo.dev)
2 points by hagbard_c 49 days ago | past
Promptfoo Raises $18.4M Series A to Build the Definitive AI Security Stack (promptfoo.dev)
1 point by swyx 89 days ago | past
Political-bias benchmark for Grok 4, GPT-4.1, Gemini 2.5 Pro and Claude Opus 4 (promptfoo.dev)
3 points by dangelosaurus 4 months ago | past
Next Generation of Red Teaming for LLM Agents (promptfoo.dev)
1 point by mooreds 5 months ago | past
Promptfoo: Secure your AI from prompt to production (promptfoo.dev)
2 points by handfuloflight 6 months ago | past
Questions censored by DeepSeek (promptfoo.dev)
384 points by typpo 10 months ago | past | 227 comments
Automated jailbreaking techniques with DALL-E (promptfoo.dev)
2 points by typpo on July 1, 2024 | past
Show HN: Automated red teaming for your LLM app (promptfoo.dev)
23 points by typpo on June 13, 2024 | past | 2 comments
Iterate on LLMs Faster (promptfoo.dev)
1 point by gmays on May 28, 2024 | past
Benchmark Command R vs. GPT/Claude on your own data (promptfoo.dev)
2 points by typpo on April 9, 2024 | past
DBRX vs. Mixtral vs. GPT: create your own benchmark (promptfoo.dev)
1 point by typpo on March 31, 2024 | past
How to benchmark Gemini vs. GPT with your own data (promptfoo.dev)
1 point by typpo on Dec 15, 2023 | past
Llama 2 uncensored vs. GPT3.5 benchmarking (promptfoo.dev)
3 points by mchiang on Aug 11, 2023 | past
How to benchmark Llama2 Uncensored vs. GPT-3.5 on your own inputs (promptfoo.dev)
16 points by typpo on Aug 10, 2023 | past
Benchmark Llama 2 vs. GPT on your own data (promptfoo.dev)
1 point by typpo on July 24, 2023 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: