32 year old male from Los Angeles likes surfing, travel, and music
65 year old male is a retired financial planner from Oregon who likes sports (especially golf) and literature (specifically fiction)
26 year old male from San Jose works in the internet industry, likes video games and photography
30 year old female waitress from Boston likes travel, skiing, and eating at nice restaurants
18 year old woman from San Francisco likes music, movies and keeping up with celebrities
43 year old house wife and mother of 3 from Chicago goes to church, likes cooking and keeping up with celebrities
58 year old female science teacher from Cincinnati likes going to the movies and travels in the summer
19 year old male from Austin works as an auto mechanic, likes cars, sports, and working out
41 year old male business executive from New York likes politics and sports
The goal of content cleansing is to ensure we pass the cleanest possible signal into our interest graph service. It is a multiple step process that involves:
You can check out the results of the content cleansing phase for your user's 10 articles below. To learn more, head over to our Goose Content Extractor Labs page
Now that we have the data for each article, we run them through our Interest Graphing process.
Now that we've graphed all the articles the user has viewed, we merge them together and prune them to create the user's interest graph.