In this paper, we propose Text2Scene, a model that generates various forms of
compositional scene representations from natural language descriptions.…
Use your arXiv email address to see your arXiv papers in GroundAI.
By signing up you accept our content policy
Already have an account? Sign in
No a member yet? Create an account