Modeling Scenes And Human Activities In Videos