๐Ÿ™TakoAPI
โ† ์—์ด์ „ํŠธ ๋งˆ์ผ“ํ”Œ๋ ˆ์ด์Šค
์˜คํ”ˆ์†Œ์Šค ํ”„๋กœ์ ํŠธ

VibeSearchBench

VibeBench ์ œ์ž‘

๐Ÿ” The hardest search benchmark in the wild โ€” vague, multi-turn, proactive. 200 long-horizon tasks with persona-driven progressive disclosure, scored by verifiable schema-free knowledge-graph evaluation. No vibes, just triplet F1.

๋ณ„ 928๊ฐœ์…€ํ”„ ํ˜ธ์ŠคํŒ…

์Šคํ‚ฌ

์˜คํ”ˆ์†Œ์Šค ํ”„๋กœ์ ํŠธ์ž…๋‹ˆ๋‹ค โ€” ์ฝ”๋“œ๋ฅผ ์‚ดํŽด๋ณด๊ณ  GitHub์—์„œ ์…€ํ”„ ํ˜ธ์ŠคํŒ…ํ•˜์„ธ์š”.