The large multimodal language model, GPT-4, is ready for prime time, although, contrary to reports circulating since Friday, it doesn’t support the ability to produce videos from text.
GPT-4 can, however, accept image and text input and produce text output. Over a range of domains — including documents with text and photographs, diagrams, or screenshots — GPT-4 exhibits similar…