Skip to main content

Andrew Ng's Landing AI Introduces Visual Prompting

Andrew Ng's company, Landing AI, introduces Visual Prompting, a technology aimed at simplifying and accelerating the development of computer vision models.
Created on April 26|Last edited on April 26
Prompting has gained popularity as an efficient method in natural language processing (NLP), allowing machine learning models to generate accurate and relevant responses with minimal retraining efforts.
By using carefully designed prompts, developers can leverage the vast knowledge embedded in pre-trained models and fine-tune them for specific tasks without the need for extensive new training data. This not only reduces the computational resources and time spent on training but also broadens access to AI technology.

NLP Ideas

Emerging work now seeks to apply these same principles to the field of computer vision. Visual Prompting, an innovative technique inspired by NLP prompting, focuses on generating results across multiple images using a single, well-crafted visual prompt. This method enables developers to efficiently fine-tune computer vision models for various tasks without extensive training data, thus streamlining the development process and bringing the benefits of AI technology to a wider range of applications and industries.

ChatGPT for Vision

Machine learning scientist Andrew Ng’s company Landing AI has introduced Visual Prompting, a technology aimed at simplifying and accelerating the development of computer vision models. Inspired by generative AI text interfaces like ChatGPT, Visual Prompting allows users to quickly and efficiently label data for computer vision applications. Users specify a visual prompt by painting over object classes they want the system to detect. The algorithm then makes inferences based on the provided prompt, allowing users to refine their models if initial results are unsatisfactory.

Different From SAM

Landing AI's Visual Prompting differs from Meta's SAM in its ability to apply a single prompt to multiple images. While SAM focuses on segmenting just one image, Landing AI's method takes a single example and applies it to various images, making it more efficient and versatile in handling different tasks across multiple images.

The Platform

Visual Prompting is part of LandingLens, Landing AI's flagship product designed to make implementing computer vision accessible. The platform enables users to create, deploy, and scale AI-powered industrial computer-vision applications with greater accuracy and speed. Despite being in beta, Visual Prompting has shown promise in over two-thirds of the 40 use cases analyzed by Landing AI. The company is working to improve the system and plans to collaborate with the community to further develop the technology.
The Introduction:

Tags: ML News
Iterate on AI agents and models faster. Try Weights & Biases today.