WO2012075335A2 - Recommendations based on topic clusters - Google Patents

Recommendations based on topic clusters Download PDF

Info

Publication number
WO2012075335A2
WO2012075335A2 PCT/US2011/062956 US2011062956W WO2012075335A2 WO 2012075335 A2 WO2012075335 A2 WO 2012075335A2 US 2011062956 W US2011062956 W US 2011062956W WO 2012075335 A2 WO2012075335 A2 WO 2012075335A2
Authority
WO
WIPO (PCT)
Prior art keywords
topics
user
profile
topic
interaction
Prior art date
Application number
PCT/US2011/062956
Other languages
French (fr)
Other versions
WO2012075335A3 (en
Inventor
Benjamin Liebald
Palash Nandy
Dasarathi Sampath
Junning Liu
Ye NIU
Christina Ilvento
Yu-To Chen
Jamie Davidson
Original Assignee
Google Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google Inc. filed Critical Google Inc.
Priority to EP11845853.8A priority Critical patent/EP2646964A4/en
Priority to CN201180065617.6A priority patent/CN103329151B/en
Publication of WO2012075335A2 publication Critical patent/WO2012075335A2/en
Publication of WO2012075335A3 publication Critical patent/WO2012075335A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • G06F16/337Profile generation, learning or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • G06F16/435Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/7867Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title and artist information, manually generated time, location and usage information, user ratings
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/0482Interaction with lists of selectable items, e.g. menus
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/80Responding to QoS
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/635Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • G06F16/735Filtering based on additional data, e.g. user or group profiles

Definitions

  • the disclosure generally relates to creating and storing user profiles based on content consumption.
  • Content hosting services generally attempt to present content that is generally of interest to its users.
  • Some content hosting services allow users to create user profiles that indicate demographic information (e.g., gender, age), as well as areas of interests or content topics. The content hosting service then attempts to use such profiles to select content to provide to the users.
  • demographic information e.g., gender, age
  • the content hosting service attempts to use such profiles to select content to provide to the users.
  • the users may not be able to articulate all their interests while populating their profile. Additionally, users' interests typically change over time and the users may not update their profiles to reflect these changes.
  • a user's profile is created based on the user's interaction with content items in a content hosting service.
  • a user's interactions with the content items on the content hosting service are recorded.
  • a user analysis module determines topics associated with the content items with which the user has interacted. The user analysis module then selects the topics for the user's profiles based recorded interactions and the associated topics.
  • a user profile is created which represents the selected topics.
  • the topics associated with the content items have associated topic strengths and the user analysis module selects the topics for user's profiles based on the topic strengths.
  • the user's interactions with various content items have associated interaction strengths and the user analysis module selects the topics for user's profiles based on the associated interaction strengths, and stores the topic association strengths for the selected topics in the user profile.
  • topics in the user profile are mapped to clusters of topics, and the mapped cluster topics replace or accompany the user topics in the user profile.
  • Various user-cluster communities are formed that include users whose profiles have a common topic cluster. Recommendations to these user- cluster communities can be made based on the user interactions of some of the users in a community.
  • FIG. 1 illustrates a system for determining and storing the users' profile including their areas of interest according to one embodiment.
  • FIG. 2 is a flow diagram illustrating a method for determining and storing the users' profile including their areas of interest according to one
  • Fig. 3 is a block diagram illustrating the user analysis module that determines and stores the user profiles according to one embodiment.
  • Fig. 4 is a screen illustrating an interface for receiving users' areas of interests for storage in their profiles according to one embodiment.
  • Fig. 5 illustrates a co-occurrence matrix that stores co-occurrence strengths indicating the measure of co-occurrence of a first topic with another topic according to one embodiment.
  • Fig. 6 illustrates a method for providing recommendations based on the users' interactions according to one embodiment.
  • the computing environment described herein enables determination and storage of user profiles that represent, for each user, a set of topics indicative of the user's interests, based on the user's interaction with content items.
  • FIG. 1 illustrates a system for determining and storing user profiles.
  • a video hosting service 100 includes a front end web server 140, a video serving module 110, a video database 155, a user analysis module 120, a user access log 160, a topic repository 164 and a profile repository 166.
  • Video hosting service 100 is connected to a network 180.
  • FIG. 1 also includes a client 170 and third-party service 175 having an embedded video 178.
  • www.youtube.com; other video hosting services are known as well, and can be adapted to operate according to the teaching disclosed here.
  • service represents any computer system adapted to serve content using any internetworking protocols, and is not intended to be limited to content uploaded or downloaded via the Internet or the HTTP protocol. In general, functions described in one
  • the servers and modules described herein are implemented as computer programs executing on server-class computer comprising a CPU, memory, network interface, peripheral interfaces, and other well known components.
  • the computers themselves in some embodiments run a conventional proprietary or open- source operating system such as Microsoft Windows, Mac OS, Linux, etc., have generally high performance CPUs, gigabytes or more of memory, and gigabytes, terabytes, or more of disk storage.
  • a client 170 connect to the front end server 140 via network 180, which is typically the internet, but can also be any network, including but not limited to any combination of a LAN, a MAN, a WAN, a mobile, wired or wireless network, a private network, or a virtual private network. While only a single client 170 is shown, it is understood that very large numbers (e.g., millions) of clients can be supported and can be in communication with the video hosting service 100 at any time.
  • the client 170 may include a variety of different computing devices.
  • client devices 170 are personal computers, digital assistants, personal digital assistants, cellular phones, mobile phones, smart phones or laptop computers. As will be clear to one of ordinary skill in the art, the present invention is not limited to the devices listed above.
  • the client includes a browser or a dedicated application that allows client 170 to present content provided on the video hosting service 100.
  • Suitable applications include, for example, Microsoft Internet Explorer, Netscape Navigator, Mozilla Firefox, Apple Safari, and Google Chrome.
  • the browser can also include or a support a plug-in for a video player (e.g., FlashTM from Adobe Systems, Inc.), or any other player adapted for the video file formats used in the video hosting service 100.
  • videos can be accessed by a standalone program separate from the browser.
  • the digital content items can include, for example, video, audio or a combination of video and audio.
  • a digital content item may be a still image, such as a JPEG or GIF file or a text file.
  • the digital content items will be referred to as a "video,” “video files,” or “video items,” but no limitation on the type of digital content items are indented by this terminology.
  • Other suitable types of digital content items include audio files (e.g. music, podcasts, audio books, and the like), documents, images, multimedia presentations, and so forth.
  • the video hosting service 100 provides videos that have been uploaded by other users of the video hosting service 100, or may have been provided by the video hosting service operator, or by third parties.
  • Clients 170 can search for videos based on keywords or other metadata. These requests are received as queries by the front end server 140 and provided to the video serving module 1 10, which is responsible for searching the video database 155 for videos that satisfy the user queries and providing the videos to the users.
  • the video serving module 110 supports searching on any fielded data for a video, including its title, description, metadata, author, category and so forth.
  • users can browse a list of videos based on categories such as most viewed videos, sports, animals, or automobiles. For example, the user may browse a list of videos related to cars and select which videos from the list to view.
  • Video database 155 stores videos provided to clients 170. Each video in one embodiment has a video identifier (id). Each video file has associated metadata associated that includes video ID, author, title, description, and keywords, additional metadata can be included as available.
  • the metadata also includes one or more topics that are associated with the video.
  • the associated topics may include topics created by a community in a collaborative knowledge base like Freebase. Alternatively, the topics may be selected from the frequently occurring topics occurring in the titles, descriptions, and user comments of the videos, for example the 100,000 most frequently occurring term unigrams or bigrams.
  • each topic is associated with a topic strength TS representing the topics' degree of association with the video.
  • the topic strength for a particular topic and video is based on content analysis of the video, users' comments for the video, or other metadata associated with the video.
  • the topics and topic strength information can be stored in a separate database.
  • the topic strength for a video is also adjusted based on the usefulness of a topic.
  • the usefulness of a topic is a weight reflecting how useful is a topic to a system in representing the topic's association with the video. For example, the system operator may not prefer topics that represent racy or objectionable content and therefore the usefulness weight for such topics may be a low or a negative value.
  • the usefulness of a topic is based on the frequency of topic in the corpus.
  • the user access log 160 stores access data describing the user's access and interactions with videos.
  • the access data indicates whether a user watched an entire video, watched a video for a particular duration, skipped a video, scrolled up or down through a web page including a video, shared a video with other users, added a video to a playlist, flagged a video, blocked a video from a playlist or a collection of videos, favorited a video, gave a video a favorable rating (e.g. liked a video using a FACEBOOKTM account or +1 'd a video using a GOOGLE+TM account), gave a video an unfavorable rating (e.g. "thumbs down").
  • the user access log 160 or another entity associated with the user access log 160 provides the users with the opportunity to opt-out of having the users' access data collected and/or shared with other modules in the video hosting service 100 or other services.
  • the profile repository 164 stores the user profiles.
  • a user profile includes a set of topics for a user. This set of topics represents the user's interest and the list may be partly populated by receiving a number of topics from the user.
  • the user profile may include the topics as a list of topics (e.g., as terms or topic identifiers), or as vector (e.g., bit map, or vector of real valued weights).
  • the topics stored in a user's profile can be used for various purposes.
  • the topics can be displayed as user's area of interest on the user's home page in a social network or a content hosting network.
  • the topics may be used to suggest to the user content, content channels, products, services, additional topics etc. that may be of interest to the user.
  • the suggestions may be provided to the user on the user's home page or another web page like a "browse" page where a user may browse through various topics that may be of interest to the user.
  • the topics displayed on the user's home page or browse page are selectable (for e.g. through a hyperlink).
  • a user may select a topic and the selection leads the user to a web page partly or wholly dedicated to the selected topic.
  • the selected topic's web page includes content related to the selected topic, like related multimedia content or textual content. Additionally, the topic's web page may include links to other related topics' web pages. These related topics may be displayed as topics related to the selected topic or recommended topics for a user visiting the selected topic's web page.
  • the user analysis module 120 determines and stores a user profile based on the videos accessed by the user, and is one means for performing this function.
  • Fig. 2 illustrates method executed by the user analysis module 120 for determining and storing the topics for a user profile.
  • the user analysis module 120 queries the user access log 160 and determines 202 videos accessed by the user. This set of videos can be all videos accessed by the user, or just those accessed by the user within a certain time period, such as the previous thirty days.
  • the user analysis module 120 analyzes the user's access data stored in the user access log 160 and determines 204 the user's interactions with the accessed videos. The user analysis module 120 also determines 204 the user's interaction strength for each accessed video based on factors like the type of user's interaction with the accessed video. The user analysis module 120 also queries the video database 155 and determines 206, for each video accessed by the user, the topics associated with the accessed videos and the video's topic strengths indicating the video's degree of association with the topics. Based on the determined interaction strengths and topic strengths, the user analysis module 120 selects 208 and stores 210 topics in the user's profile.
  • the user analysis module 120 also determines and provides recommendations based on the users' interaction with the videos, and is also one means for performing this function.
  • the recommendations can be recommended videos for the user or recommended topics for the videos.
  • the operation of the user analysis module 120 to provide recommendation is further described below with respect to FIG. 6.
  • Fig. 3 is a block diagram illustrating the user analysis module 120 according to one embodiment.
  • the user analysis module 120 comprises a user interaction module 302, an interaction strength module 304, a user profile module 306, a related topics module 308, a topic cluster module 301 and a cluster
  • the user interaction module 302 receives feedback regarding the users' interactions with videos and stores the received feedback as access data in the user access log 160.
  • a module (not shown) in the client 170 (or the service 175) tracks data about the user's interactions (e.g. pause, rewind, fast forward).
  • Additional user's interactions are tracked by a module (not shown) in the video hosting service 100 or at another service like a social networking service. Regardless of where the data is tracked, the data is transmitted to the user interaction module 302.
  • the user interaction module 302 receives the transmitted data and stores the received data in the user access log 160 as access data. Examples of access data stored in access log 160 are described above.
  • the user interaction module 302 repeatedly receives feedback regarding the user's interactions with various videos and updates the access data for the user based on the received feedback.
  • the interaction strength module 304 analyzes the access data for a user and determines an interaction strength IS; indicating a user's degree of association with a particular video v;. To determine the IS value, the interaction strength module 304 assigns different weights to different types of user's interactions with the video.
  • a user starting a video may be assigned a weight of 0.5
  • a user watching at least 80% of the video may be assigned a weight of 0.75
  • a user giving a favorable rating for the video may be assigned a weight of 1.5
  • a user favoriting a video may be assigned a weight of 2.0
  • a user subscribing to a channel of videos associated with the watched video or with the user who uploaded the watched video may be assigned a weight of 5.0.
  • the interaction strength module 304 assigns greater weight to the user's interactions indicating a greater involvement with a video. For example, the interaction strength module 304 assigns a greater weight to a user adding a video to a playlist, or sharing a video with others, than to the user watching the video.
  • the interaction strength module 304 adjusts the weight for a particular interaction based on the frequency or duration of the interaction. For example, the interaction strength module 304 assign a greater weight to a user's view of a particular video if the user has viewed the video a number of times instead of just once or for a ten minute duration instead of thirty seconds. In one embodiment, the interaction strength module 304 normalizes the adjusted weights based on the total number of videos the user has interacted with, the total number of times the user has interacted with the videos, or the total amount of time the user has spent interacting with the videos.
  • the interaction strength module 304 assigns negative or relatively low weights to certain interactions indicating the user's lack of interest in a particular video. For example, skipping a presented video, flagging a video, or blocking a video from a playlist may be assigned a negative weight.
  • the interaction strength module 304 discounts the weight based on their age. For example, the interaction strength module 304 exponentially decays the weight associated with a user interaction based on the amount of time elapsed since the user interaction occurred. Accordingly, a user interaction that occurred recently is assigned a higher weight than a user interaction that occurred at an earlier time.
  • the interaction strength module 304 determines and stores an interaction strength IS indicating the strength of the user's interactions or association with the video.
  • the interaction strength is based on the assigned and adjusted weights. For example, the interaction strength is a sum or product of the assigned and adjusted weights.
  • the user analysis module 120 determines for a user, the videos Vithe user has interacted with (from the user access log 160) and the user's interaction strength IS; for each of these videos (determined by the interaction strength module 304). Also, as described above, the user analysis module 120 determines for each of these videos v;, topics t associated with the video (from the video database 155) and, for each of the associated topic t k , a topic strength TS k indicating the topic's degree of association with the video (from the video database 155).
  • the user profile module 306 determines the set T of topics for a user profile based on a topic association strength TAS determined for each set s, where TAS j indicates the degree of association between set s s topics t and the user.
  • TAS j indicates the degree of association between set s s topics t and the user.
  • the user profile module 306 combines the topic strengths TS k of the set's topics t k for each of the videos v; in the set Sj . Combining the topic strengths TSs may occur by adding, averaging, or applying another arithmetic or statistical function to the topic strengths TSs.
  • the user profile module 306 selects a number of these sets based on the sets topic association strengths TAS j . For example, the user association module 306 may select fifty sets s with fifty highest topic association strengths TAS. The topics t k of the selected sets s form the set T topics for the user's profile.
  • the user profile module 306 also stores in the user's profile the topic association strengths TAS associated with the stored topics.
  • the user profile module 306 can be configured to periodically updates the stored topics in a user's profile using the process described above, based on the videos that the user interacted with since a prior update.
  • the user profile module 306 receives topics that are related with the topics stored in a user profile and stores the related topics in the user profile.
  • the user profile module 306 receives the related topics from the related topics module 308.
  • Related topics module 308 accesses the topics in a user's profile and determines additional topics related to the profile's topics.
  • related topics module 308 can determine related topics. These include a demographic approach, a topic cooccurrence approach, and a combined demographic and topic co-occurrence approach. Additional approaches to determine related topics would be apparent to one of ordinary skill in the art in light of the disclosure herein. For example, related topics may also be determined based on topics' relationships specified in a knowledgebase like Freebase.
  • related topics module 308 determines related topics based on the popularity of various topics in each of a number of demographic groups.
  • the related topics module 308 organizes the user profiles in the profile corpus based on one or more demographic category, such as gender and age group.
  • the related topics module 308 can organize the user profiles into twelve demographic groups D z of profiles based on the user's gender (male, female) and age group (e.g., 13-17, 18-24, 25-34, 35-44, 45-54; 55+).
  • the related topics module 308 determines, for each demographic group D z of user profiles, a number of most frequently occurring topics t (e.g., the top 50 most frequently occurring topics); this forms the related topic set R z for the demographic group D z . Then for a given demographic group D z , the related topics module 308 adds the related topics R z to each user profile in D z . If a topic t in R z is already present in the user profile, then it can be handled either by skipping it, or by increasing its topic association strength TAS.
  • the related topics module 308 uses the cooccurrence of topics in the user profiles to determine which topics are related to each other. To determine the related topics, the related topics module 308 initially determines, across a collection of user profiles (e.g., all user profiles in the system), pairs of topics (tj, tj) that co-occur in at least some of the user profiles in the collection, and from there determines a measure of co-occurrence for each topic pair. The determination of these co-occurring topics is described in regards to Fig. 5 below. The related topics module 308 then determines for each topic t k in the corpus, the most closely related topics ti based on the co-occurrence measure. Next, given a user profile with topics t j , the related topics module 308 adds to the user profile for each topic t j the most closely related topics ti.
  • the related topics module 308 adds to the user profile for each topic t j the most closely related topics ti.
  • Fig. 5 illustrates a co-occurrence matrix 500 that stores co-occurrence strengths CSi j indicating the measure of co-occurrence of a topic with another topic t j .
  • the illustrated co-occurrence matrix 500 is simply a graphical representation of co-occurrence strengths CSs used to aid the description of the related topics module 308, and that the matrix 500 may be stored in various data structures like arrays, lists etc.
  • the cooccurrence matrix 500 is an nXn matrix.
  • Each row 502a-n represents a topic t; and each column 504a-n represents a topic tj.
  • Each cell, like cell 508 represents the cooccurrence strength CSi j of for the pair of topics t; and t j .
  • the co-occurrence strength CSi j for the pair of topics (tj, t j ) may be determined as follows. As noted above, each topic t; in user profile has a topic association strength TAS;. Thus, for a pair of topics t; and t j co-occurring in a given user profile, the related topics module 308 computes a profile co-occurrence strength PCSi j based on the topic association strengths TAS; and TAS j .
  • the profile cooccurrence strength PCSi j may be a product, sum, average, or another arithmetic or statistical function of the pair's topic association strengths TAS; and TAS j .
  • the cooccurrence strength CSi j is then the combined PCSi j summed across all user profiles in which topics t; and t j co-occur. Each PCSi j is then normalized by the frequency of topic tj in the profile corpus. In other embodiments, combining may include averaging, adding, or performing another arithmetic or statistical function on the profile co-occurrence strengths PCS.
  • cell 508 includes the cooccurrence strength (CS) for topic t; (topic for intersecting row 502i) co-occurring with topic t j (topic for intersecting column 504j) in the profile corpus used to select topics for the co-occurrence matrix 500.
  • This co-occurrence strength (CS) is a normalized sum of topic association strengths (TASs) of t; and t j for corpus' profiles that include both these topics.
  • the sum of the topic association strengths (TASs) has been normalized by the frequency of tj's appearance in corpus' profiles.
  • cell 506 includes the co-occurrence strength (CS) for topic t j co-occurring with topic tj.
  • This co-occurrence strength (CS) is also a normalized sum of topic association strengths (TASs) of t; and t j , but this sum has been normalized by the frequency of t j 's, not ti's, appearance in the corpus' profiles.
  • TASs topic association strengths
  • the related topics module 308 After populating the co-occurrence matrix 500, the related topics module 308 identifies for each topic t; (by row) a number of cells with the highest co-occurrence strengths CSs (e.g., 50 highest values), or the cells with co-occurrence strengths CS beyond a threshold value (e.g., CSi j > 75% of maximum CS ). These cells represent the set of topics Ri that are determined to be related to topic tj.
  • CSs co-occurrence strengths
  • FIG. 5 further illustrates the method employed by the related topics module 308 to select related topics for topic T j .
  • cells 508, 510 include the highest co-occurrence strengths CSi j for topic t j (represented by row 502j).
  • the related topics module 308 identifies these cells 506, 508 as the cells with the highest co-occurrence strengths CSi j and thus identifies topics t; and t n (the topics of the intersecting columns 504i, 504n for cells 506, 508) as topics related to topic t j .
  • the related profile module 308 adds the related topics R j to the user profile. If a topic t in Ri is already present in the user profile, then it can be handled either by skipping it, or by increasing its topic association strength TAS.
  • the related topics module 308 determines related topics for a selected user from a profile corpus of users that are in same demographic group as the selected user. To determine these related topics, the related topics module 308 constructs for each demographic group D z a co-occurrence matrix 500 from a set of user profiles belonging to that group. Then for each demographic group D z , the related topics module 308 determines the related topics R z ,i for each topic; in that that group's co-occurrence matrix.
  • the related topics module 308 automatically adds related topics to each user's profile.
  • the related topics module 308 can be configured to enable users to selectively add related topics to their individual user profiles.
  • the users may add topics, including related topics, to their own profiles through an interface such as the one illustrated in Fig. 4.
  • the interface in Fig. 4 includes a profile topics column 406 and a related topics column 410.
  • the profile topics column 406 includes the topics 412 associated with a user's profile based on the analysis of the user's interactions with videos.
  • the related topics column 410 is updated to include topics 422a-n related to the selected topics 412.
  • the related topics 422a-n are determined by the related topics module 308 and presented to the user in the related topics column 410.
  • the user may select one or more related topics 422a-n, and in response to such selection, these topics are added to the user's profiles.
  • the user profile module 306 also determines and stores with the additional topics their topic association strengths TAS.
  • Fig. 6 illustrates a method executed by the user analysis module 120 for providing recommendations based on the user's interactions.
  • the user analysis module 120 determines 602 related topics based on the user profiles, using any of the previously described methods (demographic, co-occurrence, or demographic cooccurrence).
  • the user analysis module 120 After determining the related topics, the user analysis module 120 creates 604 topic clusters with related topics. The creation of topic clusters is further described in the next section, Creating Topic Clusters.
  • the user analysis module 120 associates 606 various users to the created topic clusters based on the topics in the topic clusters and the user profiles.
  • the association of users to topic clusters is further described below under the heading Associating Users with Clusters.
  • the user analysis module 120 then monitors the activity of users associated with a cluster and determines 608 a recommendation, like a video for the associated users, based on the monitored activity.
  • the user analysis module 120 provides 610 the recommendation for display to the users. The manner of making the recommendations is further described below under the heading
  • the user profiles module 306 stores topic clusters, in addition to, or instead of, the topics in the user profiles.
  • the topic clusters include a set of related topics.
  • the user profiles module 306 receives the topic clusters from the topic clusters module 310.
  • the topic cluster module 310 creates topics cluster T including topics occurring in user profiles.
  • the topic cluster module 310 may create topics clusters based on clustering algorithms like hierarchical agglomerative clustering (HAC), probabilistic models like Latent Dirichlet Allocation (LDA), or vector models, such as k-means (using rows in the co-occurrence matrix as topic vectors).
  • HAC hierarchical agglomerative clustering
  • LDA Latent Dirichlet Allocation
  • vector models such as k-means (using rows in the co-occurrence matrix as topic vectors).
  • the topic cluster module 310 clusters topics from the cooccurrence matrix 500 using HAC.
  • the co-occurrence matrix 500 stores cooccurrence strengths CSi j for pairs of topics (tj, tj) that co-occur in at least some of the user profiles in a collection of user profiles.
  • the topic cluster module 310 identifies the cell in the cooccurrence matrix 500 with the highest co-occurrence strength CSi j . After identifying the cell with the highest co-occurrence strength CSi j , the topic cluster module 310 clusters the topics (3 ⁇ 4, tj) associated with the identified strength CSi j .
  • the topic cluster module 310 determines the co-occurring topics (3 ⁇ 4, t j ) associated with the identified cell and the rows and columns associated with the determined co-occurring topics. Assume for illustration purposes that the topic cluster module 310 identifies cell 506 in cooccurrence matrix 500 as the cell with the highest co-occurrence strength CSi j . The topic cluster module 310 determines that co-occurring topics 3 ⁇ 4 and t j are associated with the identified cell 506 and combines the two topics into a cluster.
  • the topic cluster module 310 To combine the two topics (3 ⁇ 4, t j ), the topic cluster module 310 combines cells in one row with the adjacent cells in the other to get a combined row. For example, to combine rows 502i and 502j, the topic cluster module 310 combines cell 506 with cell 507, cell 508 with cell 509, and so on to get a combined row 502i- j. Similarly, the topic cluster module 310 combines cells in one column with the adjacent cells in the other to get a combined column. To combine two cells, the topic cluster module 310 combines the co-occurrence strengths CSi j of the two cells into cluster co-occurrence strengths CCSi_ j , k .
  • the cluster co-occurrence strengths CCSi_ j , k indicate the measure of co-occurrence of the cluster (including topics 3 ⁇ 4 and t j ) with another topic t k .
  • the topic cluster module 310 combines the co-occurrence strengths CSi j into cluster co-occurrence strength CCSi_ j , k by adding multiplying, averaging, or applying another arithmetic or statistical function on the co-occurrence strengths CSi j .
  • the topic cluster module 310 also normalizes the cluster co-occurrence strength CCSi_ j , k based on a factor like the frequency of the combined topics in the user profile collection.
  • the combination of the cells in the identified rows and columns leads to a new co-occurrence matrix (not shown) that includes n-1 topics wherein one of these topics is a cluster c including the combined topics t; and t j .
  • the topic cluster module 310 then repeats the step of identifying the cell in the new cooccurrence matrix with the highest co-occurrence strength CSi j and clustering the topics or clusters associated with the identified cell.
  • the identified cell may be associated with two topics, a topic and an already formed cluster, or two clusters.
  • the topic cluster module 310 keeps repeating this process of clustering until a termination condition is reached.
  • the termination condition can be a threshold number of clusters of a threshold size (in number of topics or number of videos), or the resulting cluster co-occurrence strength CCSsi_ j , k for the updated cluster does not fall below a threshold.
  • the topic cluster module 310 stores the clusters with their combined topics and cluster co-occurrence strengths CCSsi_ j , k .
  • the user profile module 306 stores topic clusters instead of, or in addition to, topics in the user profiles. To determine topic clusters c for a user's profile, the user profile module 306 identifies, for each topic in the user profile, the cluster cto which that topic belongs, adds that cluster c to a list of clusters for the user profile. The result is a user-cluster profile C comprising a plurality of clusters c.
  • the user profile module 306 determines a user cluster strength
  • the user cluster strength UCSc for an identified cluster c is the sum of the topic association strengths TASi for those topics of the user profile that are in that cluster c.
  • the user cluster strength UCS C for an identified cluster c is the weighted sum of the topic association strengths TASi for those topics of the user profile that are in that cluster c.
  • the weight for each of the topic association strengths is the cluster co-occurrence strength CCS; for the identified cluster.
  • the user profile module 306 performs other mathematical or statistical functions on the topic association strengths TASi, and cluster co-occurrence strength CCSi_to achieve the user cluster strength UCS C .
  • the result of this operation is the representation of the user cluster profile C comprising a list (or vector) of cluster strengths.
  • the user profile module 306 selects a threshold number of clusters with highest user cluster strengths, e.g., the 20 clusters c with the highest UCS values.
  • the user profile module 306 selects all clusters c with the user cluster strength UCS beyond a threshold value. The user profile module 306 stores the selected clusters as clusters for the user-cluster profile.
  • the cluster recommendation module 312 determines a
  • the cluster recommendation module 312 selects one or more of the clusters identified in the user-cluster profile of the user. Then the module 312 analyzes the access data for the other users who also have the identified cluster(s) in their respective user- cluster profiles. These users are called a user-cluster community. In this fashion, a user is a member of a plurality of user-cluster communities, each community corresponding to one of the clusters in the user's user-cluster profile.
  • the module 312 determines which video(s) are currently popular with this set of users, for example in terms of user interactions, such as number of views, user ratings, user comments, or other measures of popularity. The module 312 then selects one of more of the most popular videos as recommended videos for the given user. For example, if a threshold number of these users have interacted with a particular video, the cluster recommendation module 312 selects the video as a recommendation. If the given user has already viewed a recommended video, it can be removed from the recommendations, or demoted in the recommendation list.
  • the module 312 can also make recommendations for topics, following a similar process.
  • the module 312 determines which topics are currently popular with the user-cluster community, rather than which specific videos are popular.
  • popularity of a topic is based on the aggregated interaction measure for the videos associated with the topic and viewed by users in the particular user-cluster community.
  • the module 312 can then select one or more topics based on their aggregated interaction measures. For example, if a threshold number of selected users have interacted with videos that have a particular topic, the cluster recommendation module 312 selects the particular topic.
  • the module 312 selects videos therein, based on factors such as popularity in the user-cluster community, recency, topic strength, and so forth. For example, if a threshold number of users in the user-cluster community have interacted with a particular video, the cluster recommendation module 312 recommends that video to the user.
  • user communities are created by clustering the user-cluster profiles of a population of users.
  • the user-cluster profiles are form a sparse vector ("user cluster vector"), including for each cluster c the UCS C as described above, and a zero value for each cluster that does not have one of its topics in the user profile.
  • These user-cluster vectors can then be again clustered using k- means or other vector clustering methods, to form a number of user communities.
  • Each user is then associated with the user communities containing the user's user- cluster vector. From there, recommendations may be made as above, using the user community as the population for identifying popular videos or topics.
  • Certain aspects of the present invention include process steps and instructions described herein in the form of an algorithm. It should be noted that the process steps and instructions of the present invention could be embodied in software, firmware or hardware, and when embodied in software, could be downloaded to reside on and be operated from different platforms used by real time network operating systems.
  • the present invention also relates to an apparatus for performing the operations herein.
  • This apparatus may be specially constructed for the required purposes, or it may comprise a general-purpose computer selectively activated or reconfigured by a computer program stored in the computer.
  • a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, DVDs, CD-ROMs, magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, application specific integrated circuits (ASICs), or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus.
  • the computers referred to in the specification may include a single processor or may be architectures employing multiple processor designs for increased computing capability.
  • the algorithms and displays presented herein are not inherently related to any particular computer or other apparatus.
  • Various general-purpose systems may also be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatus to perform the required method steps. The required structure for a variety of these systems will appear from the description above.
  • the present invention is not described with reference to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any references to specific languages are provided for disclosure of enablement and best mode of the present invention.

Abstract

A system and method for developing a user's profile based on the user's interaction with content items. A module on the client rendering the content items or the service including the content items tracks the user's interactions with the content items and transmits the tracked data to a user analysis module. The user analysis module determines the topics associated with the interacted upon content items. The user analysis module then selects the topics for the user's profiles based on the received tracked data and the associated topics. The selected topics are mapped to topic clusters and the topic clusters are stored in association with the user profile. Recommendations for a user are made based on the topic clusters associated with the user' profile.

Description

RECOMMENDATIONS BASED ON TOPIC CLUSTERS
BACKGROUND
FIELD OF DISCLOSURE
[0001] The disclosure generally relates to creating and storing user profiles based on content consumption.
DESCRIPTION OF THE RELATED ART
[0002] Content hosting services generally attempt to present content that is generally of interest to its users. Some content hosting services allow users to create user profiles that indicate demographic information (e.g., gender, age), as well as areas of interests or content topics. The content hosting service then attempts to use such profiles to select content to provide to the users. However, the users may not be able to articulate all their interests while populating their profile. Additionally, users' interests typically change over time and the users may not update their profiles to reflect these changes.
SUMMARY
[0003] A user's profile is created based on the user's interaction with content items in a content hosting service. A user's interactions with the content items on the content hosting service are recorded. A user analysis module determines topics associated with the content items with which the user has interacted. The user analysis module then selects the topics for the user's profiles based recorded interactions and the associated topics. A user profile is created which represents the selected topics. In one embodiment, the topics associated with the content items have associated topic strengths and the user analysis module selects the topics for user's profiles based on the topic strengths. In another embodiment, the user's interactions with various content items have associated interaction strengths and the user analysis module selects the topics for user's profiles based on the associated interaction strengths, and stores the topic association strengths for the selected topics in the user profile. In one embodiment, topics in the user profile are mapped to clusters of topics, and the mapped cluster topics replace or accompany the user topics in the user profile. Various user-cluster communities are formed that include users whose profiles have a common topic cluster. Recommendations to these user- cluster communities can be made based on the user interactions of some of the users in a community.
[0004] The features and advantages described in the specification are not all inclusive and, in particular, many additional features and advantages will be apparent to one of ordinary skill in the art in view of the drawings, specification, and claims. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the disclosed subject matter.
BRIEF DESCRIPTION OF DRAWINGS
[0005] Fig. 1 illustrates a system for determining and storing the users' profile including their areas of interest according to one embodiment.
[0006] Fig. 2 is a flow diagram illustrating a method for determining and storing the users' profile including their areas of interest according to one
embodiment.
[0007] Fig. 3 is a block diagram illustrating the user analysis module that determines and stores the user profiles according to one embodiment.
[0008] Fig. 4 is a screen illustrating an interface for receiving users' areas of interests for storage in their profiles according to one embodiment.
[0009] Fig. 5 illustrates a co-occurrence matrix that stores co-occurrence strengths indicating the measure of co-occurrence of a first topic with another topic according to one embodiment.
[0010] Fig. 6 illustrates a method for providing recommendations based on the users' interactions according to one embodiment.
DETAILED DESCRIPTION
[0011] The computing environment described herein enables determination and storage of user profiles that represent, for each user, a set of topics indicative of the user's interests, based on the user's interaction with content items. The figures and the following description describe certain embodiments by way of illustration only. One skilled in the art will readily recognize from the following description that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles described herein. Reference will now be made in detail to several embodiments, examples of which are illustrated in the accompanying figures. It is noted that wherever practicable similar or like reference numbers may be used in the figures and may indicate similar or like functionality.
SYSTEM ENVIRONMENT
[0012] Fig. 1 illustrates a system for determining and storing user profiles. A video hosting service 100 includes a front end web server 140, a video serving module 110, a video database 155, a user analysis module 120, a user access log 160, a topic repository 164 and a profile repository 166. Video hosting service 100 is connected to a network 180. FIG. 1 also includes a client 170 and third-party service 175 having an embedded video 178.
[0013] Many conventional features, such as firewalls, load balancers, application servers, failover servers, network management tools and so forth are not shown so as not to obscure the features of the system. A suitable service for implementation of the system is the YOUTUBE™ service, found at
www.youtube.com; other video hosting services are known as well, and can be adapted to operate according to the teaching disclosed here. The term "service" represents any computer system adapted to serve content using any internetworking protocols, and is not intended to be limited to content uploaded or downloaded via the Internet or the HTTP protocol. In general, functions described in one
embodiment as being performed on the server side can also be performed on the client side in other embodiments if appropriate. In addition, the functionality attributed to a particular component can be performed by different or multiple components operating together. [0014] The servers and modules described herein are implemented as computer programs executing on server-class computer comprising a CPU, memory, network interface, peripheral interfaces, and other well known components. The computers themselves in some embodiments run a conventional proprietary or open- source operating system such as Microsoft Windows, Mac OS, Linux, etc., have generally high performance CPUs, gigabytes or more of memory, and gigabytes, terabytes, or more of disk storage. Of course, other types of computers can be used, and it is expected that as more powerful computers are developed in the future, they can be configured in accordance with the teachings here. The functionality implemented by any of the elements can be provided from computer program products that are stored in tangible computer readable storage mediums (e.g., RAM, hard disk, or optical/magnetic media).
[0015] A client 170 connect to the front end server 140 via network 180, which is typically the internet, but can also be any network, including but not limited to any combination of a LAN, a MAN, a WAN, a mobile, wired or wireless network, a private network, or a virtual private network. While only a single client 170 is shown, it is understood that very large numbers (e.g., millions) of clients can be supported and can be in communication with the video hosting service 100 at any time. The client 170 may include a variety of different computing devices.
Examples of client devices 170 are personal computers, digital assistants, personal digital assistants, cellular phones, mobile phones, smart phones or laptop computers. As will be clear to one of ordinary skill in the art, the present invention is not limited to the devices listed above.
[0016] The client includes a browser or a dedicated application that allows client 170 to present content provided on the video hosting service 100. Suitable applications include, for example, Microsoft Internet Explorer, Netscape Navigator, Mozilla Firefox, Apple Safari, and Google Chrome. The browser can also include or a support a plug-in for a video player (e.g., Flash™ from Adobe Systems, Inc.), or any other player adapted for the video file formats used in the video hosting service 100. Alternatively, videos can be accessed by a standalone program separate from the browser.
[0017] The digital content items can include, for example, video, audio or a combination of video and audio. Alternatively, a digital content item may be a still image, such as a JPEG or GIF file or a text file. For purposes of convenience and the description of one embodiment, the digital content items will be referred to as a "video," "video files," or "video items," but no limitation on the type of digital content items are indented by this terminology. Other suitable types of digital content items include audio files (e.g. music, podcasts, audio books, and the like), documents, images, multimedia presentations, and so forth.
[0018] The video hosting service 100 provides videos that have been uploaded by other users of the video hosting service 100, or may have been provided by the video hosting service operator, or by third parties. Clients 170 can search for videos based on keywords or other metadata. These requests are received as queries by the front end server 140 and provided to the video serving module 1 10, which is responsible for searching the video database 155 for videos that satisfy the user queries and providing the videos to the users. The video serving module 110 supports searching on any fielded data for a video, including its title, description, metadata, author, category and so forth. Alternatively, users can browse a list of videos based on categories such as most viewed videos, sports, animals, or automobiles. For example, the user may browse a list of videos related to cars and select which videos from the list to view.
[0019] Video database 155 stores videos provided to clients 170. Each video in one embodiment has a video identifier (id). Each video file has associated metadata associated that includes video ID, author, title, description, and keywords, additional metadata can be included as available. The metadata also includes one or more topics that are associated with the video. The associated topics may include topics created by a community in a collaborative knowledge base like Freebase. Alternatively, the topics may be selected from the frequently occurring topics occurring in the titles, descriptions, and user comments of the videos, for example the 100,000 most frequently occurring term unigrams or bigrams.
[0020] In one embodiment, each topic is associated with a topic strength TS representing the topics' degree of association with the video. The topic strength for a particular topic and video is based on content analysis of the video, users' comments for the video, or other metadata associated with the video. Alternatively, instead of being stored with the metadata of each video, the topics and topic strength information can be stored in a separate database. [0021] In one embodiment, the topic strength for a video is also adjusted based on the usefulness of a topic. The usefulness of a topic is a weight reflecting how useful is a topic to a system in representing the topic's association with the video. For example, the system operator may not prefer topics that represent racy or objectionable content and therefore the usefulness weight for such topics may be a low or a negative value. In another example, the usefulness of a topic is based on the frequency of topic in the corpus.
[0022] The user access log 160 stores access data describing the user's access and interactions with videos. The access data indicates whether a user watched an entire video, watched a video for a particular duration, skipped a video, scrolled up or down through a web page including a video, shared a video with other users, added a video to a playlist, flagged a video, blocked a video from a playlist or a collection of videos, favorited a video, gave a video a favorable rating (e.g. liked a video using a FACEBOOK™ account or +1 'd a video using a GOOGLE+™ account), gave a video an unfavorable rating (e.g. "thumbs down"). In one embodiment, the user access log 160 or another entity associated with the user access log 160 provides the users with the opportunity to opt-out of having the users' access data collected and/or shared with other modules in the video hosting service 100 or other services.
[0023] The profile repository 164 stores the user profiles. A user profile includes a set of topics for a user. This set of topics represents the user's interest and the list may be partly populated by receiving a number of topics from the user. The user profile may include the topics as a list of topics (e.g., as terms or topic identifiers), or as vector (e.g., bit map, or vector of real valued weights).
Additionally, the list is populated by the user analysis module 120. The topics stored in a user's profile can be used for various purposes. For example, the topics can be displayed as user's area of interest on the user's home page in a social network or a content hosting network. Additionally, the topics may be used to suggest to the user content, content channels, products, services, additional topics etc. that may be of interest to the user. The suggestions may be provided to the user on the user's home page or another web page like a "browse" page where a user may browse through various topics that may be of interest to the user. [0024] In one embodiment, the topics displayed on the user's home page or browse page are selectable (for e.g. through a hyperlink). A user may select a topic and the selection leads the user to a web page partly or wholly dedicated to the selected topic. The selected topic's web page includes content related to the selected topic, like related multimedia content or textual content. Additionally, the topic's web page may include links to other related topics' web pages. These related topics may be displayed as topics related to the selected topic or recommended topics for a user visiting the selected topic's web page.
[0025] The user analysis module 120 determines and stores a user profile based on the videos accessed by the user, and is one means for performing this function. Fig. 2 illustrates method executed by the user analysis module 120 for determining and storing the topics for a user profile. To determine the topics, the user analysis module 120 queries the user access log 160 and determines 202 videos accessed by the user. This set of videos can be all videos accessed by the user, or just those accessed by the user within a certain time period, such as the previous thirty days.
[0026] The user analysis module 120 analyzes the user's access data stored in the user access log 160 and determines 204 the user's interactions with the accessed videos. The user analysis module 120 also determines 204 the user's interaction strength for each accessed video based on factors like the type of user's interaction with the accessed video. The user analysis module 120 also queries the video database 155 and determines 206, for each video accessed by the user, the topics associated with the accessed videos and the video's topic strengths indicating the video's degree of association with the topics. Based on the determined interaction strengths and topic strengths, the user analysis module 120 selects 208 and stores 210 topics in the user's profile.
[0027] The user analysis module 120 also determines and provides recommendations based on the users' interaction with the videos, and is also one means for performing this function. The recommendations can be recommended videos for the user or recommended topics for the videos. The operation of the user analysis module 120 to provide recommendation is further described below with respect to FIG. 6. [0028] Fig. 3 is a block diagram illustrating the user analysis module 120 according to one embodiment. The user analysis module 120 comprises a user interaction module 302, an interaction strength module 304, a user profile module 306, a related topics module 308, a topic cluster module 301 and a cluster
recommendation module.
[0029] The user interaction module 302 receives feedback regarding the users' interactions with videos and stores the received feedback as access data in the user access log 160. A module (not shown) in the client 170 (or the service 175) tracks data about the user's interactions (e.g. pause, rewind, fast forward).
Additional user's interactions (e.g. the user requesting a video, rating a video, sharing a video) are tracked by a module (not shown) in the video hosting service 100 or at another service like a social networking service. Regardless of where the data is tracked, the data is transmitted to the user interaction module 302. The user interaction module 302 receives the transmitted data and stores the received data in the user access log 160 as access data. Examples of access data stored in access log 160 are described above. The user interaction module 302 repeatedly receives feedback regarding the user's interactions with various videos and updates the access data for the user based on the received feedback.
[0030] The interaction strength module 304 analyzes the access data for a user and determines an interaction strength IS; indicating a user's degree of association with a particular video v;. To determine the IS value, the interaction strength module 304 assigns different weights to different types of user's interactions with the video. For example, a user starting a video may be assigned a weight of 0.5, a user watching at least 80% of the video may be assigned a weight of 0.75, a user giving a favorable rating for the video may be assigned a weight of 1.5, a user favoriting a video may be assigned a weight of 2.0, and a user subscribing to a channel of videos associated with the watched video or with the user who uploaded the watched video may be assigned a weight of 5.0. The interaction strength module 304 assigns greater weight to the user's interactions indicating a greater involvement with a video. For example, the interaction strength module 304 assigns a greater weight to a user adding a video to a playlist, or sharing a video with others, than to the user watching the video. Additionally, the interaction strength module 304 adjusts the weight for a particular interaction based on the frequency or duration of the interaction. For example, the interaction strength module 304 assign a greater weight to a user's view of a particular video if the user has viewed the video a number of times instead of just once or for a ten minute duration instead of thirty seconds. In one embodiment, the interaction strength module 304 normalizes the adjusted weights based on the total number of videos the user has interacted with, the total number of times the user has interacted with the videos, or the total amount of time the user has spent interacting with the videos.
[0031] The interaction strength module 304 assigns negative or relatively low weights to certain interactions indicating the user's lack of interest in a particular video. For example, skipping a presented video, flagging a video, or blocking a video from a playlist may be assigned a negative weight.
[0032] In one embodiment, the interaction strength module 304 discounts the weight based on their age. For example, the interaction strength module 304 exponentially decays the weight associated with a user interaction based on the amount of time elapsed since the user interaction occurred. Accordingly, a user interaction that occurred recently is assigned a higher weight than a user interaction that occurred at an earlier time.
[0033] After assigning and adjusting weights for the user's interactions with a particular video, the interaction strength module 304 determines and stores an interaction strength IS indicating the strength of the user's interactions or association with the video. The interaction strength is based on the assigned and adjusted weights. For example, the interaction strength is a sum or product of the assigned and adjusted weights.
[0034] As described above, the user analysis module 120 determines for a user, the videos Vithe user has interacted with (from the user access log 160) and the user's interaction strength IS; for each of these videos (determined by the interaction strength module 304). Also, as described above, the user analysis module 120 determines for each of these videos v;, topics t associated with the video (from the video database 155) and, for each of the associated topic tk, a topic strength TSk indicating the topic's degree of association with the video (from the video database 155).
[0035] Based on this information, the user profile module 306 determines a set T of topics for a user's profile. To determine the topics T for a user profile, the user profile module 306 sorts the videos v; the user interacted with based on the topics tk associated with the videos. The sort results in sets S= {si, s2, s3.. . Sj } of topics such that each set Sj includes a topic tk and its associated user's videos Vi,k. The user profile module 306 selects a number of the topic sets s, where each selected set has a minimum number of videos, e.g., each selected topic set has at least 20 videos. The topics tk of the selected sets s form the set T topics for the user's profile.
[0036] Alternatively, the user profile module 306 determines the set T of topics for a user profile based on a topic association strength TAS determined for each set s, where TASj indicates the degree of association between set s s topics t and the user. To determine the topic association strength TASj for a particular set Sj of topics tk, the user profile module 306 combines the topic strengths TSk of the set's topics tk for each of the videos v; in the set Sj . Combining the topic strengths TSs may occur by adding, averaging, or applying another arithmetic or statistical function to the topic strengths TSs. After determining the topic association strength TASj for each set Sj in S, the user profile module 306 selects a number of these sets based on the sets topic association strengths TASj. For example, the user association module 306 may select fifty sets s with fifty highest topic association strengths TAS. The topics tk of the selected sets s form the set T topics for the user's profile.
[0037] The user profile module 306 also stores in the user's profile the topic association strengths TAS associated with the stored topics. The user profile module 306 can be configured to periodically updates the stored topics in a user's profile using the process described above, based on the videos that the user interacted with since a prior update.
[0038] Additionally, in one embodiment, the user profile module 306 receives topics that are related with the topics stored in a user profile and stores the related topics in the user profile. The user profile module 306 receives the related topics from the related topics module 308. Related topics module 308 accesses the topics in a user's profile and determines additional topics related to the profile's topics.
[0039] There are several different ways that the related topics module 308 can determine related topics. These include a demographic approach, a topic cooccurrence approach, and a combined demographic and topic co-occurrence approach. Additional approaches to determine related topics would be apparent to one of ordinary skill in the art in light of the disclosure herein. For example, related topics may also be determined based on topics' relationships specified in a knowledgebase like Freebase.
[0040] Related Topics based on Demographics
[0041] In one embodiment, related topics module 308 determines related topics based on the popularity of various topics in each of a number of demographic groups. In this embodiment, the related topics module 308 organizes the user profiles in the profile corpus based on one or more demographic category, such as gender and age group. For example, the related topics module 308 can organize the user profiles into twelve demographic groups Dz of profiles based on the user's gender (male, female) and age group (e.g., 13-17, 18-24, 25-34, 35-44, 45-54; 55+). The related topics module 308 then determines, for each demographic group Dz of user profiles, a number of most frequently occurring topics t (e.g., the top 50 most frequently occurring topics); this forms the related topic set Rz for the demographic group Dz. Then for a given demographic group Dz, the related topics module 308 adds the related topics Rz to each user profile in Dz. If a topic t in Rz is already present in the user profile, then it can be handled either by skipping it, or by increasing its topic association strength TAS.
[0042] Related Topics based on Topic Co-Occurrence
[0043] In another embodiment, the related topics module 308 uses the cooccurrence of topics in the user profiles to determine which topics are related to each other. To determine the related topics, the related topics module 308 initially determines, across a collection of user profiles (e.g., all user profiles in the system), pairs of topics (tj, tj) that co-occur in at least some of the user profiles in the collection, and from there determines a measure of co-occurrence for each topic pair. The determination of these co-occurring topics is described in regards to Fig. 5 below. The related topics module 308 then determines for each topic tk in the corpus, the most closely related topics ti based on the co-occurrence measure. Next, given a user profile with topics tj, the related topics module 308 adds to the user profile for each topic tj the most closely related topics ti.
[0044] Fig. 5 illustrates a co-occurrence matrix 500 that stores co-occurrence strengths CSij indicating the measure of co-occurrence of a topic with another topic tj. One of ordinary skill in the art will understand that the illustrated co-occurrence matrix 500 is simply a graphical representation of co-occurrence strengths CSs used to aid the description of the related topics module 308, and that the matrix 500 may be stored in various data structures like arrays, lists etc. Given n topics t, the cooccurrence matrix 500 is an nXn matrix. Each row 502a-n represents a topic t; and each column 504a-n represents a topic tj. Each cell, like cell 508 represents the cooccurrence strength CSij of for the pair of topics t; and tj.
[0045] The co-occurrence strength CSij for the pair of topics (tj, tj) may be determined as follows. As noted above, each topic t; in user profile has a topic association strength TAS;. Thus, for a pair of topics t; and tj co-occurring in a given user profile, the related topics module 308 computes a profile co-occurrence strength PCSij based on the topic association strengths TAS; and TASj. The profile cooccurrence strength PCSij may be a product, sum, average, or another arithmetic or statistical function of the pair's topic association strengths TAS; and TASj. The cooccurrence strength CSij is then the combined PCSij summed across all user profiles in which topics t; and tj co-occur. Each PCSij is then normalized by the frequency of topic tj in the profile corpus. In other embodiments, combining may include averaging, adding, or performing another arithmetic or statistical function on the profile co-occurrence strengths PCS.
[0046] An example illustrated in Fig. 5 assists in describing the method for computing the co-occurrence strengths (CSs). In Fig. 5, cell 508 includes the cooccurrence strength (CS) for topic t; (topic for intersecting row 502i) co-occurring with topic tj (topic for intersecting column 504j) in the profile corpus used to select topics for the co-occurrence matrix 500. This co-occurrence strength (CS) is a normalized sum of topic association strengths (TASs) of t; and tj for corpus' profiles that include both these topics. The sum of the topic association strengths (TASs) has been normalized by the frequency of tj's appearance in corpus' profiles. Similarly, cell 506 includes the co-occurrence strength (CS) for topic tj co-occurring with topic tj. This co-occurrence strength (CS) is also a normalized sum of topic association strengths (TASs) of t; and tj, but this sum has been normalized by the frequency of tj's, not ti's, appearance in the corpus' profiles.
[0047] After populating the co-occurrence matrix 500, the related topics module 308 identifies for each topic t; (by row) a number of cells with the highest co-occurrence strengths CSs (e.g., 50 highest values), or the cells with co-occurrence strengths CS beyond a threshold value (e.g., CSij > 75% of maximum CS ). These cells represent the set of topics Ri that are determined to be related to topic tj.
[0048] The example illustrated in Fig. 5 further illustrates the method employed by the related topics module 308 to select related topics for topic Tj. In Fig. 5, assume that cells 508, 510 include the highest co-occurrence strengths CSij for topic tj (represented by row 502j). The related topics module 308 identifies these cells 506, 508 as the cells with the highest co-occurrence strengths CSij and thus identifies topics t; and tn (the topics of the intersecting columns 504i, 504n for cells 506, 508) as topics related to topic tj.
[0049] Finally, given a user profile of topics t, for each topic t; therein the related profile module 308 adds the related topics Rj to the user profile. If a topic t in Ri is already present in the user profile, then it can be handled either by skipping it, or by increasing its topic association strength TAS.
[0050] Related Topics based on Demographics and Co-Occurrence
[0051] In one embodiment, the related topics module 308 determines related topics for a selected user from a profile corpus of users that are in same demographic group as the selected user. To determine these related topics, the related topics module 308 constructs for each demographic group Dz a co-occurrence matrix 500 from a set of user profiles belonging to that group. Then for each demographic group Dz, the related topics module 308 determines the related topics Rz,i for each topic; in that that group's co-occurrence matrix.
[0052] User Selected Topics
[0053] In the foregoing embodiments, the related topics module 308 automatically adds related topics to each user's profile. Alternatively, the related topics module 308 can be configured to enable users to selectively add related topics to their individual user profiles. In one embodiment, the users may add topics, including related topics, to their own profiles through an interface such as the one illustrated in Fig. 4. The interface in Fig. 4 includes a profile topics column 406 and a related topics column 410. The profile topics column 406 includes the topics 412 associated with a user's profile based on the analysis of the user's interactions with videos. In response to a user selecting one or more of the topics 412 in the profile topics column 406, the related topics column 410 is updated to include topics 422a-n related to the selected topics 412. The related topics 422a-n are determined by the related topics module 308 and presented to the user in the related topics column 410. The user may select one or more related topics 422a-n, and in response to such selection, these topics are added to the user's profiles. In one embodiment, the user profile module 306 also determines and stores with the additional topics their topic association strengths TAS.
[0054] Recommendations Based on Topic Clusters
[0055] Fig. 6 illustrates a method executed by the user analysis module 120 for providing recommendations based on the user's interactions. The user analysis module 120 determines 602 related topics based on the user profiles, using any of the previously described methods (demographic, co-occurrence, or demographic cooccurrence).
[0056] After determining the related topics, the user analysis module 120 creates 604 topic clusters with related topics. The creation of topic clusters is further described in the next section, Creating Topic Clusters.
[0057] Next, the user analysis module 120 associates 606 various users to the created topic clusters based on the topics in the topic clusters and the user profiles. The association of users to topic clusters is further described below under the heading Associating Users with Clusters.
[0058] The user analysis module 120 then monitors the activity of users associated with a cluster and determines 608 a recommendation, like a video for the associated users, based on the monitored activity. The user analysis module 120 provides 610 the recommendation for display to the users. The manner of making the recommendations is further described below under the heading
Recommendations Based on Clusters.
[0059] Creating Topic Clusters
[0060] In one embodiment, the user profiles module 306 stores topic clusters, in addition to, or instead of, the topics in the user profiles. The topic clusters include a set of related topics. The user profiles module 306 receives the topic clusters from the topic clusters module 310.
[0061] The topic cluster module 310 creates topics cluster T including topics occurring in user profiles. The topic cluster module 310 may create topics clusters based on clustering algorithms like hierarchical agglomerative clustering (HAC), probabilistic models like Latent Dirichlet Allocation (LDA), or vector models, such as k-means (using rows in the co-occurrence matrix as topic vectors). In one embodiment, the topic cluster module 310 clusters topics from the cooccurrence matrix 500 using HAC. Again, the co-occurrence matrix 500 stores cooccurrence strengths CSij for pairs of topics (tj, tj) that co-occur in at least some of the user profiles in a collection of user profiles. To create topics clusters from the co-occurrence matrix 500, the topic cluster module 310 identifies the cell in the cooccurrence matrix 500 with the highest co-occurrence strength CSij. After identifying the cell with the highest co-occurrence strength CSij, the topic cluster module 310 clusters the topics (¾, tj) associated with the identified strength CSij.
[0062] To cluster the associated topics, the topic cluster module 310 determines the co-occurring topics (¾, tj) associated with the identified cell and the rows and columns associated with the determined co-occurring topics. Assume for illustration purposes that the topic cluster module 310 identifies cell 506 in cooccurrence matrix 500 as the cell with the highest co-occurrence strength CSij. The topic cluster module 310 determines that co-occurring topics ¾ and tj are associated with the identified cell 506 and combines the two topics into a cluster.
[0063] To combine the two topics (¾, tj), the topic cluster module 310 combines cells in one row with the adjacent cells in the other to get a combined row. For example, to combine rows 502i and 502j, the topic cluster module 310 combines cell 506 with cell 507, cell 508 with cell 509, and so on to get a combined row 502i- j. Similarly, the topic cluster module 310 combines cells in one column with the adjacent cells in the other to get a combined column. To combine two cells, the topic cluster module 310 combines the co-occurrence strengths CSij of the two cells into cluster co-occurrence strengths CCSi_j,k. The cluster co-occurrence strengths CCSi_j,k indicate the measure of co-occurrence of the cluster (including topics ¾ and tj) with another topic tk. The topic cluster module 310 combines the co-occurrence strengths CSij into cluster co-occurrence strength CCSi_j,k by adding multiplying, averaging, or applying another arithmetic or statistical function on the co-occurrence strengths CSij. In one embodiment, the topic cluster module 310 also normalizes the cluster co-occurrence strength CCSi_j,k based on a factor like the frequency of the combined topics in the user profile collection.
[0064] The combination of the cells in the identified rows and columns leads to a new co-occurrence matrix (not shown) that includes n-1 topics wherein one of these topics is a cluster c including the combined topics t; and tj. The topic cluster module 310 then repeats the step of identifying the cell in the new cooccurrence matrix with the highest co-occurrence strength CSij and clustering the topics or clusters associated with the identified cell. The identified cell may be associated with two topics, a topic and an already formed cluster, or two clusters. The topic cluster module 310 keeps repeating this process of clustering until a termination condition is reached. The termination condition can be a threshold number of clusters of a threshold size (in number of topics or number of videos), or the resulting cluster co-occurrence strength CCSsi_j,k for the updated cluster does not fall below a threshold. After the termination condition is reached, the topic cluster module 310 stores the clusters with their combined topics and cluster co-occurrence strengths CCSsi_j,k.
[0065] Associating Users with Clusters
[0066] As stated above, the user profile module 306 stores topic clusters instead of, or in addition to, topics in the user profiles. To determine topic clusters c for a user's profile, the user profile module 306 identifies, for each topic in the user profile, the cluster cto which that topic belongs, adds that cluster c to a list of clusters for the user profile. The result is a user-cluster profile C comprising a plurality of clusters c.
[0067] The user profile module 306 then determines a user cluster strength
UCSc for each identified cluster c in the user-cluster profile C indicating the degree of association of the identified cluster c with the user. The user cluster strength UCSc for an identified cluster c is the sum of the topic association strengths TASi for those topics of the user profile that are in that cluster c. In one embodiment, the user cluster strength UCSC for an identified cluster c is the weighted sum of the topic association strengths TASi for those topics of the user profile that are in that cluster c. The weight for each of the topic association strengths is the cluster co-occurrence strength CCS; for the identified cluster. In other embodiments, the user profile module 306 performs other mathematical or statistical functions on the topic association strengths TASi, and cluster co-occurrence strength CCSi_to achieve the user cluster strength UCSC. The result of this operation is the representation of the user cluster profile C comprising a list (or vector) of cluster strengths. [0068] Optionally, the user profile module 306 then selects a threshold number of clusters with highest user cluster strengths, e.g., the 20 clusters c with the highest UCS values. In another embodiment, the user profile module 306 selects all clusters c with the user cluster strength UCS beyond a threshold value. The user profile module 306 stores the selected clusters as clusters for the user-cluster profile.
[0069] Recommendations Based on Clusters
[0070] The cluster recommendation module 312 determines a
recommendation of a video or a topic for a user, based on the user-cluster profile, and the interactions of other users with the videos.
[0071] To determine the recommendation of a video to a given user, the cluster recommendation module 312 selects one or more of the clusters identified in the user-cluster profile of the user. Then the module 312 analyzes the access data for the other users who also have the identified cluster(s) in their respective user- cluster profiles. These users are called a user-cluster community. In this fashion, a user is a member of a plurality of user-cluster communities, each community corresponding to one of the clusters in the user's user-cluster profile.
[0072] Given the user -cluster community, the module 312 determines which video(s) are currently popular with this set of users, for example in terms of user interactions, such as number of views, user ratings, user comments, or other measures of popularity. The module 312 then selects one of more of the most popular videos as recommended videos for the given user. For example, if a threshold number of these users have interacted with a particular video, the cluster recommendation module 312 selects the video as a recommendation. If the given user has already viewed a recommended video, it can be removed from the recommendations, or demoted in the recommendation list.
[0073] The module 312 can also make recommendations for topics, following a similar process. Here, the module 312 determines which topics are currently popular with the user-cluster community, rather than which specific videos are popular. Here, popularity of a topic is based on the aggregated interaction measure for the videos associated with the topic and viewed by users in the particular user-cluster community. The module 312 can then select one or more topics based on their aggregated interaction measures. For example, if a threshold number of selected users have interacted with videos that have a particular topic, the cluster recommendation module 312 selects the particular topic.
[0074] From the selected topics, the module 312 then selects videos therein, based on factors such as popularity in the user-cluster community, recency, topic strength, and so forth. For example, if a threshold number of users in the user-cluster community have interacted with a particular video, the cluster recommendation module 312 recommends that video to the user.
[0075] In another embodiment, user communities are created by clustering the user-cluster profiles of a population of users. Here, the user-cluster profiles are form a sparse vector ("user cluster vector"), including for each cluster c the UCSC as described above, and a zero value for each cluster that does not have one of its topics in the user profile. These user-cluster vectors can then be again clustered using k- means or other vector clustering methods, to form a number of user communities. Each user is then associated with the user communities containing the user's user- cluster vector. From there, recommendations may be made as above, using the user community as the population for identifying popular videos or topics.
[0076] The present invention has been described in particular detail with respect to a limited number of embodiments. Those of skill in the art will appreciate that the invention may additionally be practiced in other embodiments.
[0077] Within this written description, the particular naming of the components, capitalization of terms, the attributes, data structures, or any other programming or structural aspect is not mandatory or significant, and the
mechanisms that implement the invention or its features may have different names, formats, or protocols. Further, the system may be implemented via a combination of hardware and software, as described, or entirely in hardware elements. Also, the particular division of functionality between the various system components described herein is merely exemplary, and not mandatory; functions performed by a single system component may instead be performed by multiple components, and functions performed by multiple components may instead be performed by a single
component.
[0078] Some portions of the above description present the feature of the present invention in terms of algorithms and symbolic representations of operations on information. These algorithmic descriptions and representations are the means used by those skilled in the art to most effectively convey the substance of their work to others skilled in the art. These operations, while described functionally or logically, are understood to be implemented by computer programs. Furthermore, it has also proven convenient at times, to refer to these arrangements of operations as modules or code devices, without loss of generality.
[0079] It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the present discussion, it is appreciated that throughout the description, discussions utilizing terms such as "selecting" or "computing" or "determining" or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system memories or registers or other such information storage, transmission or display devices.
[0080] Certain aspects of the present invention include process steps and instructions described herein in the form of an algorithm. It should be noted that the process steps and instructions of the present invention could be embodied in software, firmware or hardware, and when embodied in software, could be downloaded to reside on and be operated from different platforms used by real time network operating systems.
[0081] The present invention also relates to an apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general-purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, DVDs, CD-ROMs, magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, application specific integrated circuits (ASICs), or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus. Furthermore, the computers referred to in the specification may include a single processor or may be architectures employing multiple processor designs for increased computing capability. [0082] The algorithms and displays presented herein are not inherently related to any particular computer or other apparatus. Various general-purpose systems may also be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatus to perform the required method steps. The required structure for a variety of these systems will appear from the description above. In addition, the present invention is not described with reference to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any references to specific languages are provided for disclosure of enablement and best mode of the present invention.
[0083] Finally, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter.
Accordingly, the disclosure of the present invention is intended to be illustrative, but not limiting, of the scope of the invention.

Claims

WHAT IS CLAIMED IS:
1. A computer-implemented method for developing a profile of a user including topic clusters, the method comprising:
retrieving the profile including a plurality of topics, the plurality of topics indicating interests of the user;
determining a plurality of topic clusters including related topics; identifying, from the plurality of topic clusters, a topic cluster
including a topic indicating an interest of the user; and storing the identified topic cluster in association with the profile of the user.
2. A computer-implemented method for developing a profile of a user, the method comprising:
determining digital content items interacted upon by the user, each of the digital content items associated with a plurality of topics; determining the plurality of associated topics for each of the digital content items;
retrieving access data indicating the user's interactions with the
digital content items;
selecting profile topics based on the topics associated with the digital content items and the retrieved access data;
storing the selected profile topics in association with the user's
profile;
determining additional topics related to the stored profile topics, the additional topics co-occurring with profile topics in additional user profiles; and
storing the additional topics in association with the user's profile.
3. The method of claim 2, wherein the additional topics are determined based on a strength of co-occurrence of the additional topics with at least one of the profile topics.
4. The computer-implemented method of claim 2, wherein the topics associated with each digital content item have topic strengths indicating the topics' degree of association with their content item, and
the profile topics are further selected based on the topic strengths.
5. The computer-implemented method of claim 2, wherein
each of the topics associated with each digital content item have a usefulness weight indicating how useful a topic is in representing its association with the digital content item, and the profile topics are further selected based on the usefulness weights associated with the digital content items' topics.
6. The computer-implemented method of claim 5, wherein the usefulness weight of a topic is based on whether digital content items associated with the topic have objectionable content.
7. The computer-implemented method of claim 5, wherein the usefulness weight of a topic is based on a frequency with which the topic appears in a video corpus.
8. The computer-implemented method of claim 2, wherein
each of the user's interactions is associated with an interaction
strength, and
the profile topics are further selected based on the interaction
strengths associated with the user's interactions.
9. The computer-implemented method of claim 8, wherein the interaction strength of at least one user interaction is based on a frequency of the at least one user interaction.
10. The computer-implemented method of claim 8, wherein the interaction strength of at least one user interaction is based on a duration of the at least one user interaction.
11. The computer-implemented method of claim 8, wherein the interaction strength of at least one user interaction is reduced based on an amount of time elapsed since the at least one user interaction occurred.
12. The computer implemented method of claim 2, further comprising: determining additional topics that co-occur in user profiles with the selected profile topics; and storing the additional topics in association with the user's profile.
13. A computer system for developing a profile of a user, the system comprising a non-transitory computer readable medium storing instructions for:
determining digital content items interacted upon by the user, each of the digital content items associated with a plurality of topics; determining the plurality of associated topics for each of the digital content items;
retrieving access data indicating the user's interactions with the
digital content items;
selecting profile topics based on the topics associated with the digital content items and the retrieved access data;
storing the selected profile topics in association with the user's
profile;
determining additional topics related to the stored profile topics, the additional topics co-occurring with profile topics in additional user profiles; and
storing the additional topics in association with the user's profile.
14. The computer system of claim 13, wherein the additional topics are determined based on a strength of co-occurrence of the additional topics with at least one of the profile topics.
15. The computer system of claim 13, wherein
the topics associated with each digital content item have topic
strengths indicating the topics' degree of association with their content item, and
the profile topics are further selected based on the topic strengths.
16. The computer system of claim 13, wherein
each of the topics associated with each digital content item have a usefulness weight indicating how useful a topic is in representing its association with the digital content item, and the profile topics are further selected based on the usefulness weights associated with the digital content items' topics.
17. The computer system of claim 16, wherein the usefulness weight of a topic is based on whether digital content items associated with the topic have objectionable content.
18. The computer system of claim 16, wherein the usefulness weight of a topic is based on a frequency with which the topic appears in a video corpus.
19. The computer system of claim 13, wherein
each of the user's interactions is associated with an interaction
strength, and
the profile topics are further selected based on the interaction
strengths associated with the user's interactions.
20. The computer system of claim 19, wherein the interaction strength of at least one user interaction is based on a frequency of the at least one user interaction.
21. The computer system of claim 19, wherein the interaction strength of at least one user interaction is based on a duration of the at least one user interaction.
22. The computer system of claim 19, wherein the interaction strength of at least one user interaction is reduced based on an amount of time elapsed since the at least one user interaction occurred.
23. The computer system of claim 13, further comprising instructions for: determining additional topics that co-occur in user profiles with the selected profile topics; and
storing the additional topics in association with the user's profile.
PCT/US2011/062956 2010-12-01 2011-12-01 Recommendations based on topic clusters WO2012075335A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP11845853.8A EP2646964A4 (en) 2010-12-01 2011-12-01 Recommendations based on topic clusters
CN201180065617.6A CN103329151B (en) 2010-12-01 2011-12-01 Recommendation based on topic cluster

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US41881810P 2010-12-01 2010-12-01
US61/418,818 2010-12-01

Publications (2)

Publication Number Publication Date
WO2012075335A2 true WO2012075335A2 (en) 2012-06-07
WO2012075335A3 WO2012075335A3 (en) 2013-03-21

Family

ID=46163218

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/US2011/062965 WO2012075341A2 (en) 2010-12-01 2011-12-01 Personal content streams based on user-topic profiles
PCT/US2011/062956 WO2012075335A2 (en) 2010-12-01 2011-12-01 Recommendations based on topic clusters

Family Applications Before (1)

Application Number Title Priority Date Filing Date
PCT/US2011/062965 WO2012075341A2 (en) 2010-12-01 2011-12-01 Personal content streams based on user-topic profiles

Country Status (4)

Country Link
US (6) US8688706B2 (en)
EP (2) EP2646971A4 (en)
CN (2) CN103348342B (en)
WO (2) WO2012075341A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9473745B2 (en) 2014-01-30 2016-10-18 Google Inc. System and method for providing live imagery associated with map locations

Families Citing this family (103)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9477672B2 (en) 2009-12-02 2016-10-25 Gartner, Inc. Implicit profile for use with recommendation engine and/or question router
JP2011138197A (en) * 2009-12-25 2011-07-14 Sony Corp Information processing apparatus, method of evaluating degree of association, and program
US10102278B2 (en) * 2010-02-03 2018-10-16 Gartner, Inc. Methods and systems for modifying a user profile for a recommendation algorithm and making recommendations based on user interactions with items
US8688706B2 (en) 2010-12-01 2014-04-01 Google Inc. Topic based user profiles
US10390090B2 (en) * 2010-12-30 2019-08-20 Sony Corporation System and method for social interaction about content items such as movies
US9219945B1 (en) * 2011-06-16 2015-12-22 Amazon Technologies, Inc. Embedding content of personal media in a portion of a frame of streaming media indicated by a frame identifier
US9053750B2 (en) * 2011-06-17 2015-06-09 At&T Intellectual Property I, L.P. Speaker association with a visual representation of spoken content
EP2568395A1 (en) * 2011-09-08 2013-03-13 Axel Springer Digital TV Guide GmbH Method and apparatus for automatic generation of recommendations
JP2013088994A (en) * 2011-10-18 2013-05-13 Sony Corp Information processing apparatus, server, information processing system and information processing method
EP2788946A4 (en) * 2011-12-08 2015-07-29 Intel Corp Personalized passive content delivery
US9495666B2 (en) * 2011-12-15 2016-11-15 Accenture Global Services Limited End-user portal system for remote technical support
CN102708144B (en) * 2012-03-20 2015-05-27 华为技术有限公司 Method and device for information processing
TWI510064B (en) * 2012-03-30 2015-11-21 Inst Information Industry Video recommendation system and method thereof
US9110955B1 (en) * 2012-06-08 2015-08-18 Spotify Ab Systems and methods of selecting content items using latent vectors
US20130346875A1 (en) * 2012-06-20 2013-12-26 Microsoft Corporation Personalized Interactive Entertainment Profile
US9177031B2 (en) 2012-08-07 2015-11-03 Groupon, Inc. Method, apparatus, and computer program product for ranking content channels
US9607025B2 (en) 2012-09-24 2017-03-28 Andrew L. DiRienzo Multi-component profiling systems and methods
US8881209B2 (en) 2012-10-26 2014-11-04 Mobitv, Inc. Feedback loop content recommendation
US9721263B2 (en) * 2012-10-26 2017-08-01 Nbcuniversal Media, Llc Continuously evolving symmetrical object profiles for online advertisement targeting
US9773229B2 (en) * 2012-11-01 2017-09-26 Google Inc. Systems and methods for providing contact group member suggestions
US9129227B1 (en) * 2012-12-31 2015-09-08 Google Inc. Methods, systems, and media for recommending content items based on topics
US9412092B2 (en) * 2013-01-14 2016-08-09 Google Inc. Generating a filtered view of a content stream
US20140236731A1 (en) * 2013-02-21 2014-08-21 Adobe Systems Incorporated Using Interaction Data of Application Users to Target a Social-Networking Advertisement
US9455945B2 (en) * 2013-02-22 2016-09-27 Facebook, Inc. Aggregating likes to a main page
US10600011B2 (en) 2013-03-05 2020-03-24 Gartner, Inc. Methods and systems for improving engagement with a recommendation engine that recommends items, peers, and services
US9400840B2 (en) * 2013-03-25 2016-07-26 Salesforce.Com, Inc. Combining topic suggestions from different topic sources to assign to textual data items
US10068614B2 (en) * 2013-04-26 2018-09-04 Microsoft Technology Licensing, Llc Video service with automated video timeline curation
CN103501465A (en) * 2013-09-06 2014-01-08 上海骋娱传媒技术有限公司 Method and equipment used for video resource access control
US9177262B2 (en) * 2013-12-02 2015-11-03 Qbase, LLC Method of automated discovery of new topics
US9201744B2 (en) 2013-12-02 2015-12-01 Qbase, LLC Fault tolerant architecture for distributed computing systems
US9025892B1 (en) 2013-12-02 2015-05-05 Qbase, LLC Data record compression with progressive and/or selective decomposition
US9542477B2 (en) * 2013-12-02 2017-01-10 Qbase, LLC Method of automated discovery of topics relatedness
US9424524B2 (en) 2013-12-02 2016-08-23 Qbase, LLC Extracting facts from unstructured text
US9659108B2 (en) 2013-12-02 2017-05-23 Qbase, LLC Pluggable architecture for embedding analytics in clustered in-memory databases
US9424294B2 (en) 2013-12-02 2016-08-23 Qbase, LLC Method for facet searching and search suggestions
US9355152B2 (en) 2013-12-02 2016-05-31 Qbase, LLC Non-exclusionary search within in-memory databases
US9619470B2 (en) * 2014-02-04 2017-04-11 Google Inc. Adaptive music and video recommendations
US20150293995A1 (en) 2014-04-14 2015-10-15 David Mo Chen Systems and Methods for Performing Multi-Modal Video Search
US10332185B2 (en) 2014-05-22 2019-06-25 Google Llc Using status of sign-on to online services for content item recommendations
US10121187B1 (en) * 2014-06-12 2018-11-06 Amazon Technologies, Inc. Generate a video of an item
KR102253074B1 (en) 2014-06-13 2021-05-18 플립보드, 인크. Presenting advertisements in a digital magazine by clustering content
US10592539B1 (en) 2014-07-11 2020-03-17 Twitter, Inc. Trends in a messaging platform
US10601749B1 (en) * 2014-07-11 2020-03-24 Twitter, Inc. Trends in a messaging platform
US10162882B2 (en) 2014-07-14 2018-12-25 Nternational Business Machines Corporation Automatically linking text to concepts in a knowledge base
US10503761B2 (en) 2014-07-14 2019-12-10 International Business Machines Corporation System for searching, recommending, and exploring documents through conceptual associations
US10437869B2 (en) 2014-07-14 2019-10-08 International Business Machines Corporation Automatic new concept definition
US9576023B2 (en) * 2014-07-14 2017-02-21 International Business Machines Corporation User interface for summarizing the relevance of a document to a query
CN105450497A (en) * 2014-07-31 2016-03-30 国际商业机器公司 Method and device for generating clustering model and carrying out clustering based on clustering model
US10225591B2 (en) * 2014-10-21 2019-03-05 Comcast Cable Communications, Llc Systems and methods for creating and managing user profiles
US10025863B2 (en) 2014-10-31 2018-07-17 Oath Inc. Recommending contents using a base profile
US9846746B2 (en) * 2014-11-20 2017-12-19 Facebook, Inc. Querying groups of users based on user attributes for social analytics
US10068023B2 (en) 2014-12-30 2018-09-04 Rovi Guides, Inc. Systems and methods for updating links between keywords associated with a trending topic
WO2016137974A1 (en) 2015-02-23 2016-09-01 Wabash National, L.P. Composite refrigerated truck body and method of making the same
US10223442B2 (en) 2015-04-09 2019-03-05 Qualtrics, Llc Prioritizing survey text responses
US10657186B2 (en) * 2015-05-29 2020-05-19 Dell Products, L.P. System and method for automatic document classification and grouping based on document topic
RU2632131C2 (en) 2015-08-28 2017-10-02 Общество С Ограниченной Ответственностью "Яндекс" Method and device for creating recommended list of content
RU2629638C2 (en) 2015-09-28 2017-08-30 Общество С Ограниченной Ответственностью "Яндекс" Method and server of creating recommended set of elements for user
RU2632100C2 (en) 2015-09-28 2017-10-02 Общество С Ограниченной Ответственностью "Яндекс" Method and server of recommended set of elements creation
MX2016013715A (en) 2015-10-23 2017-12-20 Wabash National Lp Extruded molds and methods for manufacturing composite truck panels.
US10339160B2 (en) * 2015-10-29 2019-07-02 Qualtrics, Llc Organizing survey text responses
US10445650B2 (en) 2015-11-23 2019-10-15 Microsoft Technology Licensing, Llc Training and operating multi-layer computational models
US9560152B1 (en) * 2016-01-27 2017-01-31 International Business Machines Corporation Personalized summary of online communications
US10329763B2 (en) 2016-02-24 2019-06-25 Wabash National, L.P. Composite floor structure and method of making the same
US10479419B2 (en) 2016-02-24 2019-11-19 Wabash National, L.P. Composite refrigerated semi-trailer and method of making the same
US10239566B2 (en) 2016-02-24 2019-03-26 Wabash National, L.P. Composite floor for a dry truck body
US10848808B2 (en) * 2016-03-08 2020-11-24 Eagle Eye Networks, Inc. Apparatus for sharing private video streams with public service agencies
US10674116B2 (en) * 2016-03-08 2020-06-02 Eagle Eye Networks, Inc System and apparatus for sharing private video streams with first responders
US11381605B2 (en) * 2016-03-08 2022-07-05 Eagle Eye Networks, Inc. System, methods, and apparatus for sharing private video stream assets with first responders
US10505923B2 (en) * 2016-03-08 2019-12-10 Dean Drako Apparatus for sharing private video streams with first responders and method of operation
US10939141B2 (en) * 2016-03-08 2021-03-02 Eagle Eye Networks, Inc. Apparatus for sharing private video streams with first responders and mobile method of operation
US10733221B2 (en) * 2016-03-30 2020-08-04 Microsoft Technology Licensing, Llc Scalable mining of trending insights from text
US9992539B2 (en) 2016-04-05 2018-06-05 Google Llc Identifying viewing characteristics of an audience of a content channel
RU2632144C1 (en) 2016-05-12 2017-10-02 Общество С Ограниченной Ответственностью "Яндекс" Computer method for creating content recommendation interface
RU2632132C1 (en) 2016-07-07 2017-10-02 Общество С Ограниченной Ответственностью "Яндекс" Method and device for creating contents recommendations in recommendations system
RU2636702C1 (en) 2016-07-07 2017-11-27 Общество С Ограниченной Ответственностью "Яндекс" Method and device for selecting network resource as source of content in recommendations system
US11645317B2 (en) 2016-07-26 2023-05-09 Qualtrics, Llc Recommending topic clusters for unstructured text documents
EP3291110A1 (en) 2016-09-02 2018-03-07 OpenTV, Inc. Content recommendations using personas
US11048744B1 (en) * 2016-12-29 2021-06-29 Shutterstock, Inc. Computer architecture for weighting search results by stylistic preferences
USD882600S1 (en) 2017-01-13 2020-04-28 Yandex Europe Ag Display screen with graphical user interface
US11488028B2 (en) * 2017-03-31 2022-11-01 Yahoo Assets Llc Collaborative personalization via simultaneous embedding of users and their preferences
CN108268581A (en) * 2017-07-14 2018-07-10 广东神马搜索科技有限公司 The construction method and device of knowledge mapping
CN107391760B (en) * 2017-08-25 2018-05-25 平安科技(深圳)有限公司 User interest recognition methods, device and computer readable storage medium
CN107944033B (en) * 2017-12-13 2022-02-18 北京百度网讯科技有限公司 Associated topic recommendation method and device
US10838954B1 (en) 2017-12-14 2020-11-17 Amazon Technologies, Inc. Identifying user content
US11243669B2 (en) * 2018-02-27 2022-02-08 Verizon Media Inc. Transmitting response content items
US11539992B2 (en) * 2018-02-28 2022-12-27 Google Llc Auto-adjust playback speed and contextual information
US10606446B2 (en) 2018-05-04 2020-03-31 David Arthur Yost Computer system with a plurality of work environments where each work environment affords one or more workspaces
US10904590B2 (en) * 2018-05-23 2021-01-26 Otter Network, LLC Method and system for real time switching of multimedia content
US10990421B2 (en) 2018-06-27 2021-04-27 Microsoft Technology Licensing, Llc AI-driven human-computer interface for associating low-level content with high-level activities using topics as an abstraction
US11354581B2 (en) 2018-06-27 2022-06-07 Microsoft Technology Licensing, Llc AI-driven human-computer interface for presenting activity-specific views of activity-specific content for multiple activities
US11449764B2 (en) 2018-06-27 2022-09-20 Microsoft Technology Licensing, Llc AI-synthesized application for presenting activity-specific UI of activity-specific content
RU2720952C2 (en) 2018-09-14 2020-05-15 Общество С Ограниченной Ответственностью "Яндекс" Method and system for generating digital content recommendation
RU2720899C2 (en) 2018-09-14 2020-05-14 Общество С Ограниченной Ответственностью "Яндекс" Method and system for determining user-specific content proportions for recommendation
RU2714594C1 (en) 2018-09-14 2020-02-18 Общество С Ограниченной Ответственностью "Яндекс" Method and system for determining parameter relevance for content items
US11049153B2 (en) * 2018-09-24 2021-06-29 Salesforce.Com, Inc. Mapping and filtering recommendation engine
US11580179B2 (en) * 2018-09-24 2023-02-14 Salesforce.Com, Inc. Method and system for service agent assistance of article recommendations to a customer in an app session
RU2725659C2 (en) 2018-10-08 2020-07-03 Общество С Ограниченной Ответственностью "Яндекс" Method and system for evaluating data on user-element interactions
RU2731335C2 (en) 2018-10-09 2020-09-01 Общество С Ограниченной Ответственностью "Яндекс" Method and system for generating recommendations of digital content
RU2757406C1 (en) 2019-09-09 2021-10-15 Общество С Ограниченной Ответственностью «Яндекс» Method and system for providing a level of service when advertising content element
US10922374B1 (en) 2019-10-24 2021-02-16 Capital One Services, Llc Techniques to determine relationships of items in web-based content
US11397754B2 (en) * 2020-02-14 2022-07-26 International Business Machines Corporation Context-based keyword grouping
CA3225789A1 (en) * 2021-08-11 2023-02-16 Google Llc User interfaces for surfacing web browser history data
US11914632B2 (en) 2022-04-18 2024-02-27 International Business Machines Corporation Intelligent media data explorer

Family Cites Families (105)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6460036B1 (en) 1994-11-29 2002-10-01 Pinpoint Incorporated System and method for providing customized electronic newspapers and target advertisements
US5758257A (en) 1994-11-29 1998-05-26 Herz; Frederick System and method for scheduling broadcast of and access to video programs and other data using customer profiles
US6078866A (en) 1998-09-14 2000-06-20 Searchup, Inc. Internet site searching and listing service based on monetary ranking of site listings
US6115709A (en) * 1998-09-18 2000-09-05 Tacit Knowledge Systems, Inc. Method and system for constructing a knowledge profile of a user having unrestricted and restricted access portions according to respective levels of confidence of content of the portions
US6513039B1 (en) * 1999-06-24 2003-01-28 International Business Machines Corporation Profile inferencing through automated access control list analysis heuristics
US6442606B1 (en) 1999-08-12 2002-08-27 Inktomi Corporation Method and apparatus for identifying spoof documents
US6519648B1 (en) * 2000-01-24 2003-02-11 Friskit, Inc. Streaming media search and continuous playback of multiple media resources located on a network
US6934964B1 (en) * 2000-02-08 2005-08-23 Koninklijke Philips Electronics N.V. Electronic program guide viewing history generator method and system
DE10009297A1 (en) * 2000-02-29 2001-10-04 Siemens Ag Dynamic help system for data processor, especially for Internet or desktop use, generates user help profile logical record depending on frequencies and/or types of access
SG97922A1 (en) 2000-08-21 2003-08-20 Kent Ridge Digital Labs Knowledge discovery system
US7493372B2 (en) 2000-11-20 2009-02-17 British Telecommunications Public Limited Company Method of updating interests
US20030154180A1 (en) 2002-02-13 2003-08-14 Case Simon J. Profile management system
US20040098386A1 (en) 2001-03-30 2004-05-20 Marcus Thint Profile management system
US8260656B1 (en) 2001-04-19 2012-09-04 Amazon.Com, Inc. Mining of user-generated playlists for data regarding relationships between digital works
US7313621B2 (en) 2001-05-15 2007-12-25 Sony Corporation Personalized interface with adaptive content presentation
US7016939B1 (en) 2001-07-26 2006-03-21 Mcafee, Inc. Intelligent SPAM detection system using statistical analysis
JP4250938B2 (en) * 2001-10-15 2009-04-08 パナソニック株式会社 Communication support method and communication server
US20030093794A1 (en) 2001-11-13 2003-05-15 Koninklijke Philips Electronics N.V. Method and system for personal information retrieval, update and presentation
US7813954B1 (en) * 2001-12-14 2010-10-12 Keen Personal Media, Inc. Audiovisual system and method for displaying segmented advertisements tailored to the characteristic viewing preferences of a user
US20030170006A1 (en) * 2002-03-08 2003-09-11 Bogda Peter B. Versatile video player
EP1395056A1 (en) * 2002-08-30 2004-03-03 Sony International (Europe) GmbH Methods to create a user profile and to specify a suggestion for a next selection of the user
US7668885B2 (en) * 2002-09-25 2010-02-23 MindAgent, LLC System for timely delivery of personalized aggregations of, including currently-generated, knowledge
US20070028266A1 (en) 2002-12-04 2007-02-01 Koninklijke Philips Electronics, N.V. Groenewoudseweg 1 Recommendation of video content based on the user profile of users with similar viewing habits
US7124149B2 (en) 2002-12-13 2006-10-17 International Business Machines Corporation Method and apparatus for content representation and retrieval in concept model space
US7725544B2 (en) 2003-01-24 2010-05-25 Aol Inc. Group based spam classification
US20050060643A1 (en) 2003-08-25 2005-03-17 Miavia, Inc. Document similarity detection and classification system
US8321278B2 (en) * 2003-09-30 2012-11-27 Google Inc. Targeted advertisements based on user profiles and page profile
US20050071479A1 (en) 2003-09-30 2005-03-31 Dimitris Achlioptas Smart button
US7346839B2 (en) 2003-09-30 2008-03-18 Google Inc. Information retrieval based on historical data
US7359941B2 (en) 2004-01-08 2008-04-15 International Business Machines Corporation Method and apparatus for filtering spam email
US7392262B1 (en) 2004-02-11 2008-06-24 Aol Llc Reliability of duplicate document detection algorithms
US7624274B1 (en) 2004-02-11 2009-11-24 AOL LLC, a Delaware Limited Company Decreasing the fragility of duplicate document detecting algorithms
US7644127B2 (en) 2004-03-09 2010-01-05 Gozoom.Com, Inc. Email analysis using fuzzy matching of text
US7716223B2 (en) * 2004-03-29 2010-05-11 Google Inc. Variable personalization of search results in a search engine
US8620915B1 (en) 2007-03-13 2013-12-31 Google Inc. Systems and methods for promoting personalized search results based on personal information
US7580921B2 (en) 2004-07-26 2009-08-25 Google Inc. Phrase identification in an information retrieval system
US7707167B2 (en) * 2004-09-20 2010-04-27 Microsoft Corporation Method, system, and apparatus for creating a knowledge interchange profile
US20060074980A1 (en) 2004-09-29 2006-04-06 Sarkar Pte. Ltd. System for semantically disambiguating text information
US20060168032A1 (en) 2004-12-21 2006-07-27 Lucent Technologies, Inc. Unwanted message (spam) detection based on message content
US20060161553A1 (en) 2005-01-19 2006-07-20 Tiny Engine, Inc. Systems and methods for providing user interaction based profiles
US7962510B2 (en) 2005-02-11 2011-06-14 Microsoft Corporation Using content analysis to detect spam web pages
US8306975B1 (en) 2005-03-08 2012-11-06 Worldwide Creative Techniques, Inc. Expanded interest recommendation engine and variable personalization
US7483903B2 (en) 2005-08-17 2009-01-27 Yahoo! Inc. Unsupervised learning tool for feature correction
US7877387B2 (en) 2005-09-30 2011-01-25 Strands, Inc. Systems and methods for promotional media item selection and promotional program unit generation
US8161044B2 (en) 2005-10-26 2012-04-17 International Business Machines Corporation Faceted web searches of user preferred categories throughout one or more taxonomies
US8015065B2 (en) 2005-10-28 2011-09-06 Yahoo! Inc. Systems and methods for assigning monetary values to search terms
US7584183B2 (en) 2006-02-01 2009-09-01 Yahoo! Inc. Method for node classification and scoring by combining parallel iterative scoring calculation
US7739226B2 (en) 2006-02-09 2010-06-15 Ebay Inc. Method and system to analyze aspect rules based on domain coverage of the aspect rules
US8019777B2 (en) 2006-03-16 2011-09-13 Nexify, Inc. Digital content personalization method and system
US7822745B2 (en) * 2006-05-31 2010-10-26 Yahoo! Inc. Keyword set and target audience profile generalization techniques
US20070294721A1 (en) * 2006-06-20 2007-12-20 Sbc Knowledge Ventures, Lp System and method of providing supplemental video content related to targeted advertisements in a video stream
US7685199B2 (en) * 2006-07-31 2010-03-23 Microsoft Corporation Presenting information related to topics extracted from event classes
US20080031447A1 (en) 2006-08-04 2008-02-07 Frank Geshwind Systems and methods for aggregation of access to network products and services
ITMI20061897A1 (en) 2006-10-03 2008-04-04 Pointer S R L SYSTEMS AND METHODS FOR CLASSIFYING RESULTS OF SEARCH ENGINES
US20080091708A1 (en) 2006-10-16 2008-04-17 Idalis Software, Inc. Enhanced Detection of Search Engine Spam
US7680786B2 (en) 2006-10-30 2010-03-16 Yahoo! Inc. Optimization of targeted advertisements based on user profile information
WO2008053426A1 (en) 2006-10-31 2008-05-08 International Business Machines Corporation Identifying unwanted (spam) sms messages
US20080115173A1 (en) 2006-11-10 2008-05-15 Guideworks Llc Systems and methods for using playlists
US7885952B2 (en) 2006-12-20 2011-02-08 Microsoft Corporation Cloaking detection utilizing popularity and market value
US8196166B2 (en) 2006-12-21 2012-06-05 Verizon Patent And Licensing Inc. Content hosting and advertising systems and methods
US8402114B2 (en) 2006-12-28 2013-03-19 Advertising.Com Llc Systems and methods for selecting advertisements for display over a communications network
US7698329B2 (en) 2007-01-10 2010-04-13 Yahoo! Inc. Method for improving quality of search results by avoiding indexing sections of pages
US20080208852A1 (en) 2007-02-26 2008-08-28 Yahoo! Inc. Editable user interests profile
JP4539712B2 (en) * 2007-12-03 2010-09-08 ソニー株式会社 Information processing terminal, information processing method, and program
EP1973045A1 (en) * 2007-03-20 2008-09-24 British Telecommunications Public Limited Company Organising and storing documents
EP1975813A1 (en) * 2007-03-31 2008-10-01 Sony Deutschland Gmbh Method for content recommendation
US20080249987A1 (en) * 2007-04-06 2008-10-09 Gemini Mobile Technologies, Inc. System And Method For Content Selection Based On User Profile Data
US20080263022A1 (en) 2007-04-19 2008-10-23 Blueshift Innovations, Inc. System and method for searching and displaying text-based information contained within documents on a database
US20080294624A1 (en) * 2007-05-25 2008-11-27 Ontogenix, Inc. Recommendation systems and methods using interest correlation
US7734641B2 (en) * 2007-05-25 2010-06-08 Peerset, Inc. Recommendation systems and methods using interest correlation
US7873635B2 (en) 2007-05-31 2011-01-18 Microsoft Corporation Search ranger system and double-funnel model for search spam analyses and browser protection
US20090006368A1 (en) 2007-06-29 2009-01-01 Microsoft Corporation Automatic Video Recommendation
US20090019062A1 (en) 2007-07-09 2009-01-15 Blaksley Ventures 108, Llc System and method for providing universal profiles for networked clusters
US20090144654A1 (en) * 2007-10-03 2009-06-04 Robert Brouwer Methods and apparatus for facilitating content consumption
EP2212771A4 (en) 2007-10-05 2011-06-01 Aharon Mizrahi System and method for enabling search of content
US8136034B2 (en) 2007-12-18 2012-03-13 Aaron Stanton System and method for analyzing and categorizing text
US8037095B2 (en) 2008-02-05 2011-10-11 International Business Machines Corporation Dynamic webcast content viewer method and system
US8370930B2 (en) 2008-02-28 2013-02-05 Microsoft Corporation Detecting spam from metafeatures of an email message
US20090254563A1 (en) 2008-04-02 2009-10-08 Arnold Jeremy A Method and system for dynamically creating and updating user profiles for instant message contacts
US8307395B2 (en) * 2008-04-22 2012-11-06 Porto Technology, Llc Publishing key frames of a video content item being viewed by a first user to one or more second users
US8214346B2 (en) 2008-06-27 2012-07-03 Cbs Interactive Inc. Personalization engine for classifying unstructured documents
US9152722B2 (en) * 2008-07-22 2015-10-06 Yahoo! Inc. Augmenting online content with additional content relevant to user interest
US7870253B2 (en) * 2008-10-01 2011-01-11 The Cobalt Group, Inc. Systems and methods for aggregating user profile information in a network of affiliated websites
US20100153318A1 (en) * 2008-11-19 2010-06-17 Massachusetts Institute Of Technology Methods and systems for automatically summarizing semantic properties from documents with freeform textual annotations
US8443390B2 (en) * 2008-12-05 2013-05-14 Qualcomm Incorporated Enhanced method and apparatus for enhancing support for service delivery
US8112393B2 (en) * 2008-12-05 2012-02-07 Yahoo! Inc. Determining related keywords based on lifestream feeds
US9521013B2 (en) 2008-12-31 2016-12-13 Facebook, Inc. Tracking significant topics of discourse in forums
US8462160B2 (en) 2008-12-31 2013-06-11 Facebook, Inc. Displaying demographic information of members discussing topics in a forum
US9600581B2 (en) * 2009-02-19 2017-03-21 Yahoo! Inc. Personalized recommendations on dynamic content
US20100281025A1 (en) * 2009-05-04 2010-11-04 Motorola, Inc. Method and system for recommendation of content items
US20110029515A1 (en) 2009-07-31 2011-02-03 Scholz Martin B Method and system for providing website content
US8849725B2 (en) 2009-08-10 2014-09-30 Yahoo! Inc. Automatic classification of segmented portions of web pages
US20110041157A1 (en) * 2009-08-13 2011-02-17 Tandberg Television Inc. Systems and Methods for Selecting Content For a Subscriber of a Content Service Provider
JP5609056B2 (en) * 2009-10-14 2014-10-22 ソニー株式会社 Content relationship visualization device, display control device, content relationship visualization method and program
US8572238B2 (en) * 2009-10-22 2013-10-29 Sony Corporation Automated social networking television profile configuration and processing
US20110153638A1 (en) 2009-12-17 2011-06-23 International Business Machines Corporation Continuity and quality of artistic media collections
CA2823791A1 (en) 2010-01-11 2011-07-14 Barjinderpal S. Gill Apparatus and method for delivering target content to members on a social network
US20120066303A1 (en) 2010-03-03 2012-03-15 Waldeck Technology, Llc Synchronized group location updates
WO2011120211A1 (en) * 2010-03-29 2011-10-06 Nokia Corporation Method and apparatus for seeded user interest modeling
US20110295612A1 (en) * 2010-05-28 2011-12-01 Thierry Donneau-Golencer Method and apparatus for user modelization
US20120042262A1 (en) 2010-08-11 2012-02-16 Apple Inc. Population segmentation based on behavioral patterns
US20120102121A1 (en) * 2010-10-25 2012-04-26 Yahoo! Inc. System and method for providing topic cluster based updates
WO2012056463A1 (en) 2010-10-29 2012-05-03 Hewlett-Packard Development Company, L.P. Content recommendation for groups
US8688706B2 (en) 2010-12-01 2014-04-01 Google Inc. Topic based user profiles
US8725795B1 (en) * 2011-06-17 2014-05-13 A9.Com, Inc. Content segment optimization techniques

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of EP2646964A4 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9473745B2 (en) 2014-01-30 2016-10-18 Google Inc. System and method for providing live imagery associated with map locations
US9836826B1 (en) 2014-01-30 2017-12-05 Google Llc System and method for providing live imagery associated with map locations

Also Published As

Publication number Publication date
CN103348342A (en) 2013-10-09
CN103329151A (en) 2013-09-25
US20140372435A1 (en) 2014-12-18
US8849958B2 (en) 2014-09-30
US20120143911A1 (en) 2012-06-07
CN103348342B (en) 2017-03-15
US20120143996A1 (en) 2012-06-07
US8589434B2 (en) 2013-11-19
CN103329151B (en) 2016-09-28
WO2012075341A3 (en) 2013-01-10
EP2646971A4 (en) 2015-06-03
US9355168B1 (en) 2016-05-31
EP2646971A2 (en) 2013-10-09
WO2012075335A3 (en) 2013-03-21
US20120143871A1 (en) 2012-06-07
EP2646964A4 (en) 2015-06-03
WO2012075341A2 (en) 2012-06-07
US9275001B1 (en) 2016-03-01
US20160044131A1 (en) 2016-02-11
EP2646964A2 (en) 2013-10-09
US9317468B2 (en) 2016-04-19
US8688706B2 (en) 2014-04-01

Similar Documents

Publication Publication Date Title
US8589434B2 (en) Recommendations based on topic clusters
US11620326B2 (en) User-specific media playlists
US10387115B2 (en) Method and apparatus for generating a recommended set of items
US20220122099A1 (en) Analytical precursor mining for personalized recommendation
US9055343B1 (en) Recommending content based on probability that a user has interest in viewing the content again
RU2725659C2 (en) Method and system for evaluating data on user-element interactions
US10380649B2 (en) System and method for logistic matrix factorization of implicit feedback data, and application to media environments
US10387513B2 (en) Method and apparatus for generating a recommended content list
US11294974B1 (en) Golden embeddings
US8370286B2 (en) System for personalized term expansion and recommendation
US10642905B2 (en) System and method for ranking search engine results
US20210089573A1 (en) Systems, methods, and computer-readable products for track selection
RU2714594C1 (en) Method and system for determining parameter relevance for content items
EP3136265A1 (en) Method and apparatus for generating a recommended content list
US20220027776A1 (en) Content cold-start machine learning system
US20220027373A1 (en) Intuitive content search results suggestion system
US11711581B2 (en) Multimodal sequential recommendation with window co-attention
US20220374761A1 (en) Systems and methods for rendering near-real-time embedding models for personalized news recommendations
WO2023209691A1 (en) System and method for ranking recommendations in streaming platforms

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2011845853

Country of ref document: EP

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11845853

Country of ref document: EP

Kind code of ref document: A2