Submissions from promptfoo.dev

		Building a Security Scanner for LLM Apps (promptfoo.dev)
		7 points by danenania 12 hours ago \| past \| 3 comments
		Will Agents Hack Everything? (promptfoo.dev)
		2 points by mooreds 8 days ago \| past \| discuss
		How to replicate the Claude Code attack with Promptfoo (promptfoo.dev)
		6 points by typpo 26 days ago \| past
		How to replicate the Claude Code attack with Promptfoo (promptfoo.dev)
		2 points by mooreds 28 days ago \| past
		Will agents hack everything? (promptfoo.dev)
		6 points by danenania 32 days ago \| past \| 9 comments
		Evaluating Political Bias in LLMs (promptfoo.dev)
		2 points by hagbard_c 49 days ago \| past
		Promptfoo Raises $18.4M Series A to Build the Definitive AI Security Stack (promptfoo.dev)
		1 point by swyx 89 days ago \| past
		Political-bias benchmark for Grok 4, GPT-4.1, Gemini 2.5 Pro and Claude Opus 4 (promptfoo.dev)
		3 points by dangelosaurus 4 months ago \| past
		Next Generation of Red Teaming for LLM Agents (promptfoo.dev)
		1 point by mooreds 5 months ago \| past
		Promptfoo: Secure your AI from prompt to production (promptfoo.dev)
		2 points by handfuloflight 6 months ago \| past
		Questions censored by DeepSeek (promptfoo.dev)
		384 points by typpo 10 months ago \| past \| 227 comments
		Automated jailbreaking techniques with DALL-E (promptfoo.dev)
		2 points by typpo on July 1, 2024 \| past
		Show HN: Automated red teaming for your LLM app (promptfoo.dev)
		23 points by typpo on June 13, 2024 \| past \| 2 comments
		Iterate on LLMs Faster (promptfoo.dev)
		1 point by gmays on May 28, 2024 \| past
		Benchmark Command R vs. GPT/Claude on your own data (promptfoo.dev)
		2 points by typpo on April 9, 2024 \| past
		DBRX vs. Mixtral vs. GPT: create your own benchmark (promptfoo.dev)
		1 point by typpo on March 31, 2024 \| past
		How to benchmark Gemini vs. GPT with your own data (promptfoo.dev)
		1 point by typpo on Dec 15, 2023 \| past
		Llama 2 uncensored vs. GPT3.5 benchmarking (promptfoo.dev)
		3 points by mchiang on Aug 11, 2023 \| past
		How to benchmark Llama2 Uncensored vs. GPT-3.5 on your own inputs (promptfoo.dev)
		16 points by typpo on Aug 10, 2023 \| past
		Benchmark Llama 2 vs. GPT on your own data (promptfoo.dev)
		1 point by typpo on July 24, 2023 \| past