Search Ranking Papers

"Training examples are generated from all YouTube watches (even those embedded on other sites) rather than just watches on the recommendations we produce." The model is predicting watch time per impression, but it is unclear how YouTube constructs its training data (impression data). For example,
- If a video is at the bottom of the recommendation list and user may not scroll down, does this count as an impression?
- If a video is shown as relevant videos of an opened video, does this count as an impression?

Applying Deep Learning To Airbnb Search

They tried multi-task learning that optimizes for both booking and long view, and found out that long views increased by a lot while bookings remained neutral. This multi-task learning can be useful for advertisement modeling, where we would optimize for both clicks and conversions.

Amazon Search: The Joy of Ranking Products

"To manage the size of the training set, we sample unseen examples." An idea inspired by this statement:
- For pairwise formulation, a good idea might be sampling the possibly unseen examples. For example, user makes a click on item at position 5, then items displayed after position 5 may not be seen and can be sampled.

Online Controlled Experiments at Large Scale

“We recently ran a slowdown experiment where we slowed 10% of users by 100msec (milliseconds) and another 10% by 250msec for two weeks. The results showed that performance absolutely matters a lot today: every 100msec improves revenue by 0.6%.”
- Speed wins.

Word2vec algorithm (C implementation / gensim implementation)

syn0: random initialization
syn1neg: zero initialization
for central_word in [pick_a_central_word]:
  for context_word in [context_words_based_on_the_central_word]:
    neu1e = 0
    for d in range(0, negative + 1):
      if d == 0:
        word = context_word, label = 1
      if d > 0:
        word = negative_word, label = 0
      dot_product = syn0[central_word] * syn1neg[word]
      gradient = (label - sigmoid(dot_product)) * alpha
      syn1neg[word] += gradient * syn0[central_word]
      neu1e += gradient * syn1neg[word]
    syn0[central_word] += neu1e

Deep Learning Recommendation Model for Personalization and Recommendation Systems

Model Structure

Model Structure of this specific model (pyton dlrm_s_pytorch.py --arch-sparse-feature-size=16 --arch-mlp-bot="13-512-256-64-16" --arch-mlp-top="512-256-1" --data-generation=dataset --data-set=kaggle --raw-data-file=./dac/train.txt --loss-function=bce --round-targets=True --learning-rate=0.1 --mini-batch-size=128)

FAISS

Speed optimization: k-means clustering, find nearest centroid and check this cluster
Memory optimization: dimension reduction such as PCA, Product Quantization

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

liyinxiao / Ranking_Papers

Labels

Projects that are alternatives of or similar to Ranking Papers

Search Ranking Papers

Great talks on search ranking / recommendation system

My personal notes / ideas on papers