Hidden Capabilities and Counterintuitive Limits in Large Language Models