Reader: How We Recommend Content (original) (raw)
The WordPress.com Reader and some of our emails recommend posts and websites based on a number of different algorithms. This guide explains more about how these algorithms work to recommend content.
Why Use Algorithms
We have two goals with the algorithms that we use:
- Help people find websites they want to subscribe and keep up to date with.
- Help website builders and owners to find an audience.
We use and test multiple data sources for building these algorithms. Below we describe where we use your data and how.
Locations Where We Recommend Content
We recommend content in many places and use different algorithms for each:
- Reader Discover suggests recently popular posts based on the tags you are following. If not following any tags, it will default to the
dailyprompt
andWordPress
tags. - Reader Search (for both posts and sites) tries to find the most popular content that matches your search. The results can also be sorted reverse chronologically.
- Reader Post Recommendations (shown on the search page and in the main Reader stream) are made based on what you have recently liked or commented on, and by using collaborative filtering (if you and another user both liked a post, then we are more likely to recommend to posts the other user liked to you).
- Reader “More on WordPress.com” Related Posts (shown at the bottom of all posts in the Reader) is mostly popular content on WordPress.com that is similar to the current post.
- Reader Tag Pages list recent posts that have a particular tag. To prevent spam, posts with more than 15 tags are excluded from the tag pages.
- Related Posts only show posts from within the same site. They are mostly based off of the title and content of the posts, but there is some boosting by likes and comments.
Data Sources
The above algorithms are often being improved, and what content we show depends on a complex combination of factors. Here are examples of the types of information we may use to make our recommendations:
- The title, content, tags, and categories of posts.
- Other text from the site, such as usernames and logins; site names and the host name (support.wordpress.com).
- Total number of likes and comments.
- Who has liked and commented on a post.
- Total number of subscribers.
- Who has subscribed to a site.
- How recently a post was published.
- How often or recently a site has posted.
- The content of what you have liked and commented on.
- Whether posts have links, images, or videos.
- How often a site has been rejected from being recommended in the Reader.
Content we filter out from our algorithms:
- Sites we think may be spam.
- Sites that have mature content.
- Sites with potentially objectionable content.
- Content that is not in your language.
You can view your most recent post likes at https://wordpress.com/activities/likes. When you comment in the Reader or on any website using your WordPress.com account to comment, that comment history will be used in our recommendations.
Commenting on a site with your WordPress.com account will look as in the following comment form. To avoid your comments being included in our algorithms, you can comment anonymously by logging out.