363
Points
147
Comments
Wirbelwind
Author

Top Comments

xg15May 28
This is amazing!

Currently you can "cheat" by simply denying all requests as quickly as possible. This will give you the "security-conscious engineer" badge and a perfect score in terms of how many requests were processed. (You will get the "overblock" notification, but it's somewhat tucked away at the bottom and the screen still looks as if you won)

I also tried to play as the hustle4lyfe move fast and break things engineer and simply approved as many requests as quickly as possible - turns out, the "malicious command" popups actually slow you down. Mean!

spurgelaurelsMay 28
Fun game, but it showed the lack of security hygiene employed by the game writer. It said `cat ~/.zshrc` was bad because it would share tokens and secrets, but I would never put secrets into my shell rc.
socksyMay 28
Weird to make reading zshrc supposed unsafe when I happily publish it in my public dotfiles repo... Who the hell keeps API keys in it? OTOH it seems like lots of these AI tools keep appending PATH in it so I guess there's a fundamental misunderstanding of shell best practices in the entire AI space...

Additionally, killing the results of `lsof` is _not_ safe - if, say, you have the web page open in firefox, or a client subshell in the agent itself, then boom, there goes firefox and the agent.

eranationMay 29
Love it. One nitpick.

>npm config set registry https://npm.internal

>Pointing npm to the company's internal registry mirror as required by onboarding docs

It claimed this is safe and I was 50/50 on it but eventually rejected it.

If this README is for a public / forked repo, and that https://npm.internal is actually https://npm.internal.somethinganexternaldnscanresolve.tld

This can go bad really quickly...

In 99% of cases you would have Artifactory / Nexus (or other mirror) already set by company policy. Having a README tell you to use a different package manager url is a big red flag and seconds away from disaster...

axodMay 28
Fun little game, but I think the questions jump context so much it's a little unrepresentative. It might be better to group things into "packs", which have more real-world representative structure to them. For example, lots of "editing something.js" file permission requests, and then an "npm publish" is far more normal, and it's more of a risk, if you're used to pressing Y lots and then suddenly out of the blue...
orsornaMay 28
About three quarters of the "bad" choices are things that not only do I not care about leaking but things that an employer would not punish you for doing, even if it led to a production incident.
enetherMay 28
The permission thing is a killer to productivity, if you're running Claude I think it's more efficient to just run in a disposable sandbox (like exe.dev[1]) or in some form of docker container with permissions you're personally ok taking the risk with on a personal machine[2]

[1] - https://exe.dev/ is a new cloud provider with some very useful agent UX [2] - I built https://github.com/stanislavkozlovski/dclaude/ for this; not perfect but gets my job done on the rare occassion I need to run the coding agent locally

Visit the Original Link

Read the full content on llmgame.scalex.dev

Source
llmgame.scalex.dev
Author
Wirbelwind
Posted
May 28, 2026 at 01:02 PM


More Top Stories

koenvangilst.nl May 29
Notes from the Mistral AI Now Summit in Paris
11017 commentsby vnglst
Details
owenmcgrann.com May 29
The Dead Economy Theory
180212 commentsby WillDaSilva
Details
inkandswitch.com May 29
Bijou64: A variable-length integer encoding
14154 commentsby justinweiss
Details
techcrunch.com May 29
Robinhood now lets your AI agents trade stocks
1511 commentsby wapasta
Details
rockstarintel.com May 29
GTA 6 Developers Unionize
281145 commentsby AndrewKemendo
Details
jeffgeerling.com May 29
It's hard to justify buying a Framework 12
55124 commentsby watermelon0
Details
👋 Need help with code?