Application Of A BigQuery-based Scoring - MICES

2y ago
27 Views
2 Downloads
2.61 MB
23 Pages
Last View : 1m ago
Last Download : 3m ago
Upload by : Karl Gosselin
Transcription

Application of a BigQuery-based scoringmodel in the search management contextDiego José de Calazans & Georg Wolf

Agenda Introduction Search Management & Search Quality Automatisation as challenge Requirements and goals The collaborative scoring model Search results, pros and cons What‘s next?

Search ManagementSearch Quality as a business goal Sessions with Search: around 30%Search Revenue Share: around 53%Search Conversion Multiplier: 2.6 These and other search related KPIs show positive YoY development (16/17 vs 17/18) We definitely aim for customer relevance

Search ManagementWhat is relevance for a consumer electronics e-shop?There is definitely a lot more to consider than simply keyword matching: Assortment Issues (EOL / alternatives / accessories): “samsung galaxy s5”Inventory turnover rate / multi-channel dependencies: “tv 55 zoll”Margin in consumer electronics: “hp 301”. All in all the aim of search management is to find a “sweet spot” between customer relevance andbusiness goals that should be realized through the search.

nDCGThe NDCG expresses the similarity of an actual ranking to the ideal ranking of a list Bandwidth chosen by tester based on product know-how / plausibilityScore on product & position levelObjectivity given through clear criteria for scoring nDCG in TOP 100 about 98%https://en.wikipedia.org/wiki/Discounted cumulative gain

“Wisdom of the crowd” precision“Matching” and “Ranking” as objective criteria to be judged by testersTOP 4000https://en.wikipedia.org/wiki/Wisdom of the crowd

Search Managementscope & limitations Short-head query area “Grenze des Wahnsinns” Indirect search optimisationSegment Incursion Long tail queries ( n words)Semantic queries -Price range: hereProduct with feature: hereHigh manual effort-TestingDocumentationOptimisationReporting

Search ManagementAutomatisation as challenge / InspirationsASO - Automatic Search Optimisation clicks, carts and purchases after search are registered via events and articles are globally re-ranked

Search ManagementAutomatisation as challenge / InspirationsBigQuery scoring model Dashboard (v1) clicks, carts and purchases after search are registered via events and articles are re-ranked perqueryPrice segment also taken into account for overall scoring

In a nutshell. We aim for customer relevance (keyword matching) . but there is a lot more to consider (relevance) We have running models/processes that give a good overview overshort-head query area (nDCG / wisdom of the crowd) but that is archived with significant manual effort Automatisation is a challenge Understanding and managing long-tail query area betterSorting of true positives inside search result

Requirements and goalsThe ideally ranked search result list Displays relevant products in relation to the searchquery from the average user’s point of view Assesses product relevance by the inherent valuewhich is untainted by short-term events Is able to improve towards a best possible positionindependent from a good or bad starting point

The collaborative scoring modelLet’s assume that. We sell 10 different products (A, B, C, L ) They can be found by entering the search query “XY” How do we define what’s the most relevant product?

What‘s important for our customers?

But we do not only evaluate what products our customers arelooking at in detail Productscore for onesearch queryDetail viewAdd-to-cartPurchaseDetail / detail viewspurchases/ add-to-cartspurchases/ purchases

It is important to prevent the current search result list topredetermine the new rankingSearch 1: „apple iphone“Search 2: „tv“ PDP view of:„Apple iPhone XS 64GB“Add-to-Cart of:„Apple iPhone XS 64GB“PDP view of: „Samsung TV 55uc643“PDP view of: „Sony TV 49OLED123“PDP view of:„Apple iPhone XS 64GB“Add-to-Cart of:„Apple iPhone XS 64GB“PDP view of: „Samsung TV 55uc643“PDP view of: „Sony TV 49OLED123“

Two types of errors regarding the selected time window canoccurIdealtimeframeMinimum Error isnegatively correlatedto the amount ofavailable data withina defined timeframeTimeframe in days What about short-term or long-term advertising campaigns?

Should I better buy smartphone A or B?We asked thousands of users.Score m m * Log20(ds / m)m:ds:rolling Score MedianDaily ScoreProduct:Product:Apple iPhone XR 64GB BlackApple iPhone 8 64GB Space GrayDaily Score:13.9 kDaily Score:18.1 kDaily Score:4.8 kScore:4.8 kScore:5.1 kDaily Score:3.3 kIntention: rank up products with a high relevance to the search query Effects from advertising campaigns should not influence product score But: long-term changes in price or product popularity should influence the score

Model evaluation 1/3For the generic search query “waschmaschine”Collaborative filteringCurrent search list resultsFucus on user interaction with decreasing relevanceStrong focus on text matching

Model evaluation 2/3For the search query “iphone x”Collaborative filteringCurrent SearchFucus on user interaction with decreasing relevancyStrong focus on text matching, rule based

Model evaluation 3/3Discover product alternatives for discontinued productsHigh-traffic saleslines:MediaMarkt GermanyPrices- S7:- A7:- A6:- S8:- S9:- S10:- P20 lite:325 259 (-20%)214 (-34%)419 ( 29%)526 ( 62%)899 (NEW)229 (-30%)Mid-traffic saleslines:MediaMarkt AustriaPrices- S7:- S8:- A7:- S8 :- Note8:- iPhone 6s:347 419 259 599 499 349 ( 20%)(-25%)( 73%)( 44%)( 0%)

Pros and consPros Up-to-date nDCGs are available every day Less manual work for the nDCG evaluation Higher nDCGs accuracy by taking into account user interactions Product alternatives can be calculated and displayedCons A certain inaccuracy if two search queries regularly occur together A lot of user interaction data is needed to achieve good results

What’s next?Each step in the development of the new Search Engine becomes measurableUntil now: Dashboarding Daily recognition of potentially bad rankingsEasily finding of good product alternatives for discontinued productsNow: Data driven field optimization (new search engine) Recognize false negatives (products) regarding to a certain search queryTest several field configurations with a quality indicationNext: Automated relevance optimization (new search engine) Improve relevance for the long tailIntegrate highly relevant alternative products automaticallyLearn field weights that maximizes the average nDCGnDCGQuery Product Score Field A * weight A Field B * weight B Prod popularity * weights Pp

Thank you!Diego José de Calazanscalazans@media-saturn.comGeorg Wolfwolfg@media-saturn.com

Semantic queries Price range: here Product with feature: here - High manual effort - Testing - Documentation - Optimisation - Reporting. Search Management ASO - Automatic Search Optimisation clicks, carts and purchases after search are

Related Documents:

What is BigQuery? BigQuery is a service provided by Google Cloud Platform, a suite of products & services that includes application hosting, cloud computing, database services, etc on on Google's scalable infrastructure BigQuery is Google's fully managed solution for companies who need

Google BigQuery Google BigQuery is a cloud-based, fully managed, serverless enterprise data warehouse that supports analytics over petabyte-scale data. It delivers high-speed analysis of large data sets as a service. BigQuery scales its use of hardware up or down to maximize performance of each query, adding and removing compute and

The following table maps standard data-warehouse concepts to those in BigQuery: Data warehouse BigQuery Data warehouse The BigQuery service replaces the typical hardware setup for a traditional data warehouse. That is, it serves as a collective home for all analytical data in an organization. Data mart Datasets are collections of tables that .

In this tutorial , I will explain how to integrate Google's BigQuery API with web application. My web application is going to get the data using BigQuery and plot a graph using d3.js and Javascript. Each project on Google Developers Console has a clientID and you need to copy the clientID and put it as a config: var gconfig {

The Solution: Google BigQuery Serverless Enterprise Data Warehouse Google BigQuery is a cloud-based, fully managed, serverless enterprise data warehouse that supports analytics over petabyte-scale data. It delivers high-speed analysis of large data sets while reducing or eliminating investments in onsite infrastructure or database administrators.

Google's BigQuery has some useful datasets bigquery-public-data.github_repos is a favourite of mine There's an httparchive dataset too b

or by using Google Cloud Dataflow. BigQuery provides the ability to connect to federated (external) data sources such as Google Cloud Bigtable, Google Cloud Storage (GCS) and Google Drive. BigQuery has native integrations with many third-party reporting and BI providers, such as Tableau, MicroStrategy and Looker.

of this nature, and includes an example of a migration from another cloud data warehouse to BigQuery. It also outlines some of the important differences between BigQuery and other database solutions, and addresses the most frequently asked questions ab