Vision API product search (which is a feature of
Discovery AI for Retail) is connected directly to our arrival service, which processes incoming data from our partners. We then convert that data into products. And since we are working with a large number of sellers who often offer the same products, we need to identify if the product already exists in the database. If it is already there, then we merge the new stock with the existing product. This is where Vision AI product search comes into play, as it helps us match the same products based on image similarity.
As mentioned above, we used to only be able perform manual searches of the same product to merge the new stock. Manual operations are, of course, time consuming, ineffective in meeting demand, and often resulted in the creation of duplicates. Consequently, a separate post-production process was needed to address those duplicates, which used a lot of resources. All of this resulted in longer processing times, which impacted our time to market, and was a factor in lower conversion rates.
Vision AI allows us to enhance the product creation process, so whenever we have new products, we streamline them through our quality gates to ensure that we are offering the best customer experience. Given the number of products we have on the platform, one of our most important factors is product processing time. Increasing the speed of processing time by just a few seconds could see big business gains.
To further this cause, we built our own visual search service (VSS), which acts as a supportive service within the product creation process. VSS does two main things. Its primary job is to index products that are similar to the one coming from our partners. This is done based on pictures of the products. It then synchronizes product images between our product centers. When new products come in — which we call an arrival — VSS is responsible for uploading each product and its main image to Google Cloud Storage, and this is sent for indexing with Vision API product search. We then start the manual processing step. At the same time, VSS is querying Vision API product search which uses its pretrained image embeddings to get information about similar products, displaying potential duplicates. Our product center operators can then see if it is a duplicate. Once they confirm it, the products are merged.