| | Building a Security Scanner for LLM Apps (promptfoo.dev) |
| 7 points by danenania 12 hours ago | past | 3 comments |
|
| | Will Agents Hack Everything? (promptfoo.dev) |
| 2 points by mooreds 8 days ago | past | discuss |
|
| | How to replicate the Claude Code attack with Promptfoo (promptfoo.dev) |
| 6 points by typpo 26 days ago | past |
|
| | How to replicate the Claude Code attack with Promptfoo (promptfoo.dev) |
| 2 points by mooreds 28 days ago | past |
|
| | Will agents hack everything? (promptfoo.dev) |
| 6 points by danenania 32 days ago | past | 9 comments |
|
| | Evaluating Political Bias in LLMs (promptfoo.dev) |
| 2 points by hagbard_c 49 days ago | past |
|
| | Promptfoo Raises $18.4M Series A to Build the Definitive AI Security Stack (promptfoo.dev) |
| 1 point by swyx 89 days ago | past |
|
| | Political-bias benchmark for Grok 4, GPT-4.1, Gemini 2.5 Pro and Claude Opus 4 (promptfoo.dev) |
| 3 points by dangelosaurus 4 months ago | past |
|
| | Next Generation of Red Teaming for LLM Agents (promptfoo.dev) |
| 1 point by mooreds 5 months ago | past |
|
| | Promptfoo: Secure your AI from prompt to production (promptfoo.dev) |
| 2 points by handfuloflight 6 months ago | past |
|
| | Questions censored by DeepSeek (promptfoo.dev) |
| 384 points by typpo 10 months ago | past | 227 comments |
|
| | Automated jailbreaking techniques with DALL-E (promptfoo.dev) |
| 2 points by typpo on July 1, 2024 | past |
|
| | Show HN: Automated red teaming for your LLM app (promptfoo.dev) |
| 23 points by typpo on June 13, 2024 | past | 2 comments |
|
| | Iterate on LLMs Faster (promptfoo.dev) |
| 1 point by gmays on May 28, 2024 | past |
|
| | Benchmark Command R vs. GPT/Claude on your own data (promptfoo.dev) |
| 2 points by typpo on April 9, 2024 | past |
|
| | DBRX vs. Mixtral vs. GPT: create your own benchmark (promptfoo.dev) |
| 1 point by typpo on March 31, 2024 | past |
|
| | How to benchmark Gemini vs. GPT with your own data (promptfoo.dev) |
| 1 point by typpo on Dec 15, 2023 | past |
|
| | Llama 2 uncensored vs. GPT3.5 benchmarking (promptfoo.dev) |
| 3 points by mchiang on Aug 11, 2023 | past |
|
| | How to benchmark Llama2 Uncensored vs. GPT-3.5 on your own inputs (promptfoo.dev) |
| 16 points by typpo on Aug 10, 2023 | past |
|
| | Benchmark Llama 2 vs. GPT on your own data (promptfoo.dev) |
| 1 point by typpo on July 24, 2023 | past |
|