Holistic Scene Understanding And Goal-Directed Multi-Agent Event Parsing