Similar Items: From Controlled to the Wild: Evaluation of Pentesting Agents for the Real-World