I ran "OWL-ViT inference playground" in the locally installed jupyter all the way to the "Text-conditioned detection" cell and no error was reported, but there was no interactive function. Under the "text-conditioned detection" cell, image and Text input boxes will be displayed. However, when a text prompt word is entered in the text input box, the image does not display the predicted bounding box and there are no other output prompts.
I ran "OWL-ViT inference playground" in the locally installed jupyter all the way to the "Text-conditioned detection" cell and no error was reported, but there was no interactive function. Under the "text-conditioned detection" cell, image and Text input boxes will be displayed. However, when a text prompt word is entered in the text input box, the image does not display the predicted bounding box and there are no other output prompts.