{"id":148,"date":"2013-05-30T17:18:10","date_gmt":"2013-05-30T15:18:10","guid":{"rendered":"http:\/\/data-science.ch\/wordpress\/?p=148"},"modified":"2014-06-20T14:34:29","modified_gmt":"2014-06-20T12:34:29","slug":"user-centric-learning-to-rank","status":"publish","type":"post","link":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/","title":{"rendered":"User-centric Learning to Rank"},"content":{"rendered":"<p>In a recent CTI project with our industry partner Nektoon AG we were involved in the development of the\u00a0context intelligence application <a title=\"Squirro\" href=\"http:\/\/www.squirro.com\" target=\"_blank\">Squirro<\/a>. In Squirro, users can create topics that consist of various text streams such as RSS feeds, blogs and Facebook accounts (see for example the following marketing video from Nektoon):<\/p>\n<div class=\"lyte-wrapper\" style=\"width:640px;max-width:100%;margin:5px;\"><div class=\"lyMe\" id=\"WYL_2PLQBUNRMM0\"><div id=\"lyte_2PLQBUNRMM0\" data-src=\"https:\/\/blog.zhaw.ch\/datascience\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=%2F%2Fi.ytimg.com%2Fvi%2F2PLQBUNRMM0%2Fhqdefault.jpg\" class=\"pL\"><div class=\"tC\"><div class=\"tT\"><\/div><\/div><div class=\"play\"><\/div><div class=\"ctrl\"><div class=\"Lctrl\"><\/div><div class=\"Rctrl\"><\/div><\/div><\/div><noscript><a href=\"https:\/\/youtu.be\/2PLQBUNRMM0\" rel=\"nofollow\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/blog.zhaw.ch\/datascience\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2F2PLQBUNRMM0%2F0.jpg\" alt=\"YouTube video thumbnail\" width=\"640\" height=\"340\" \/><br \/>Watch this video on YouTube<\/a><\/noscript><\/div><\/div><div class=\"lL\" style=\"max-width:100%;width:640px;margin:5px;\"><br\/><span class=\"lyte_disclaimer\">Defaulttext aus wp-youtube-lyte.php<\/span><\/div><\/p>\n<p>One particular problem was to design and implement a method to identify text documents in a stream that a user might be interested to read. For example in an RSS feed of a company, a user might only be interested in a specific product of this particular company. Thus, he will generally ignore documents about other topics and would prefer to not seeing these anymore. The chosen approach is to infer the future interests of a user based on his past interactions with documents. From these actions we can determine a set of documents which the user is expected to be interested in and create a profile for each user using state of the art text feature selection methods. This allows us to calculate how well a document matches the usual interest of a user. According to this ranking we sort the documents and thus documents matching the user\u2019s interest profile most closely rise to the top ranks.<\/p>\n<p><!--more--><\/p>\n<p>To validate our method we created a test collection of 20 different topics containing approximately 2700 documents in total. For each stream, the relevance of all contained documents was judged and additionally, appropriate user interactions were performed to model a typical user. Subsequently, the user\u2019s interests were estimated using the method described above and the documents were ranked accordingly.<\/p>\n<p>RSS feed readers and the like normally sort documents by their publishing date. The following plot shows a comparison of the average precision<a href=\"#footnote-1\">[1]<\/a> for different numbers of retrieved documents between our implemented ranking and the simple ranking by date. As expected, a striking difference becomes apparent.<\/p>\n<figure id=\"attachment_699\" aria-describedby=\"caption-attachment-699\" style=\"width: 823px\" class=\"wp-caption alignnone\"><a href=\"http:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-699 size-full\" src=\"http:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png\" alt=\"user-centricRelevance\" width=\"823\" height=\"541\" srcset=\"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png 823w, https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance-300x197.png 300w, https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance-456x300.png 456w\" sizes=\"auto, (max-width: 823px) 100vw, 823px\" \/><\/a><figcaption id=\"caption-attachment-699\" class=\"wp-caption-text\">Average precision: user-centric ranking vs. storted by date<\/figcaption><\/figure>\n<p id=\"footnote-1\">[1] Precision is a metric employed in information retrieval evaluation, which denotes the ratio of the number of relevant documents to the number of documents retrieved in a result set. The plot shows average precision over all 20 test streams (y-axis) at n retrieved documents (x-axis).<\/p>\n<div class=\"pt-sm\">Schlagw\u00f6rter: <a href=\"https:\/\/blog.zhaw.ch\/datascience\/tag\/information-retrieval\/\">Information Retrieval<\/a>, <a href=\"https:\/\/blog.zhaw.ch\/datascience\/tag\/text-analytics\/\">Text Analytics<\/a><br><\/div>","protected":false},"excerpt":{"rendered":"<p>In a recent CTI project with our industry partner Nektoon AG we were involved in the development of the\u00a0context intelligence application Squirro. In Squirro, users can create topics that consist of various text streams such as RSS feeds, blogs and Facebook accounts (see for example the following marketing video from Nektoon): Watch this video on [&hellip;]<\/p>\n","protected":false},"author":44,"featured_media":699,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ngg_post_thumbnail":0,"footnotes":""},"categories":[7,9],"tags":[8,10],"features":[],"class_list":["post-148","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-blog","category-research","tag-information-retrieval","tag-text-analytics"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.2 (Yoast SEO v27.2) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>User-centric Learning to Rank - Data Science made in Switzerland<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"User-centric Learning to Rank\" \/>\n<meta property=\"og:description\" content=\"In a recent CTI project with our industry partner Nektoon AG we were involved in the development of the\u00a0context intelligence application Squirro. In Squirro, users can create topics that consist of various text streams such as RSS feeds, blogs and Facebook accounts (see for example the following marketing video from Nektoon): Watch this video on [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/\" \/>\n<meta property=\"og:site_name\" content=\"Data Science made in Switzerland\" \/>\n<meta property=\"article:published_time\" content=\"2013-05-30T15:18:10+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2014-06-20T12:34:29+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png\" \/>\n\t<meta property=\"og:image:width\" content=\"823\" \/>\n\t<meta property=\"og:image:height\" content=\"541\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Melanie Imhof\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Melanie Imhof\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/\"},\"author\":{\"name\":\"Melanie Imhof\",\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/#\/schema\/person\/53375579a9cf3cc9931d74ba49d1bcd7\"},\"headline\":\"User-centric Learning to Rank\",\"datePublished\":\"2013-05-30T15:18:10+00:00\",\"dateModified\":\"2014-06-20T12:34:29+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/\"},\"wordCount\":385,\"commentCount\":0,\"image\":{\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png\",\"keywords\":[\"Information Retrieval\",\"Text Analytics\"],\"articleSection\":[\"Blog\",\"Research\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/\",\"url\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/\",\"name\":\"User-centric Learning to Rank - Data Science made in Switzerland\",\"isPartOf\":{\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png\",\"datePublished\":\"2013-05-30T15:18:10+00:00\",\"dateModified\":\"2014-06-20T12:34:29+00:00\",\"author\":{\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/#\/schema\/person\/53375579a9cf3cc9931d74ba49d1bcd7\"},\"breadcrumb\":{\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#primaryimage\",\"url\":\"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png\",\"contentUrl\":\"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png\",\"width\":823,\"height\":541},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Startseite\",\"item\":\"https:\/\/blog.zhaw.ch\/datascience\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"User-centric Learning to Rank\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/#website\",\"url\":\"https:\/\/blog.zhaw.ch\/datascience\/\",\"name\":\"Data Science made in Switzerland\",\"description\":\"Ein Blog der ZHAW Z\u00fcrcher Hochschule f\u00fcr Angewandte Wissenschaften\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/blog.zhaw.ch\/datascience\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/blog.zhaw.ch\/datascience\/#\/schema\/person\/53375579a9cf3cc9931d74ba49d1bcd7\",\"name\":\"Melanie Imhof\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/d41ae100b7810c1fed509c4c3395c60d3fad3d5dc755b9141c708e77cb788d09?s=96&d=mm&r=g\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/d41ae100b7810c1fed509c4c3395c60d3fad3d5dc755b9141c708e77cb788d09?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/d41ae100b7810c1fed509c4c3395c60d3fad3d5dc755b9141c708e77cb788d09?s=96&d=mm&r=g\",\"caption\":\"Melanie Imhof\"},\"url\":\"https:\/\/blog.zhaw.ch\/datascience\/author\/imhf\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"User-centric Learning to Rank - Data Science made in Switzerland","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/","og_locale":"en_US","og_type":"article","og_title":"User-centric Learning to Rank","og_description":"In a recent CTI project with our industry partner Nektoon AG we were involved in the development of the\u00a0context intelligence application Squirro. In Squirro, users can create topics that consist of various text streams such as RSS feeds, blogs and Facebook accounts (see for example the following marketing video from Nektoon): Watch this video on [&hellip;]","og_url":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/","og_site_name":"Data Science made in Switzerland","article_published_time":"2013-05-30T15:18:10+00:00","article_modified_time":"2014-06-20T12:34:29+00:00","og_image":[{"width":823,"height":541,"url":"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png","type":"image\/png"}],"author":"Melanie Imhof","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Melanie Imhof","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#article","isPartOf":{"@id":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/"},"author":{"name":"Melanie Imhof","@id":"https:\/\/blog.zhaw.ch\/datascience\/#\/schema\/person\/53375579a9cf3cc9931d74ba49d1bcd7"},"headline":"User-centric Learning to Rank","datePublished":"2013-05-30T15:18:10+00:00","dateModified":"2014-06-20T12:34:29+00:00","mainEntityOfPage":{"@id":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/"},"wordCount":385,"commentCount":0,"image":{"@id":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#primaryimage"},"thumbnailUrl":"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png","keywords":["Information Retrieval","Text Analytics"],"articleSection":["Blog","Research"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/","url":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/","name":"User-centric Learning to Rank - Data Science made in Switzerland","isPartOf":{"@id":"https:\/\/blog.zhaw.ch\/datascience\/#website"},"primaryImageOfPage":{"@id":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#primaryimage"},"image":{"@id":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#primaryimage"},"thumbnailUrl":"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png","datePublished":"2013-05-30T15:18:10+00:00","dateModified":"2014-06-20T12:34:29+00:00","author":{"@id":"https:\/\/blog.zhaw.ch\/datascience\/#\/schema\/person\/53375579a9cf3cc9931d74ba49d1bcd7"},"breadcrumb":{"@id":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#primaryimage","url":"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png","contentUrl":"https:\/\/blog.zhaw.ch\/datascience\/files\/2013\/05\/user-centricRelevance.png","width":823,"height":541},{"@type":"BreadcrumbList","@id":"https:\/\/blog.zhaw.ch\/datascience\/user-centric-learning-to-rank\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Startseite","item":"https:\/\/blog.zhaw.ch\/datascience\/"},{"@type":"ListItem","position":2,"name":"User-centric Learning to Rank"}]},{"@type":"WebSite","@id":"https:\/\/blog.zhaw.ch\/datascience\/#website","url":"https:\/\/blog.zhaw.ch\/datascience\/","name":"Data Science made in Switzerland","description":"Ein Blog der ZHAW Z\u00fcrcher Hochschule f\u00fcr Angewandte Wissenschaften","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/blog.zhaw.ch\/datascience\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/blog.zhaw.ch\/datascience\/#\/schema\/person\/53375579a9cf3cc9931d74ba49d1bcd7","name":"Melanie Imhof","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/d41ae100b7810c1fed509c4c3395c60d3fad3d5dc755b9141c708e77cb788d09?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/d41ae100b7810c1fed509c4c3395c60d3fad3d5dc755b9141c708e77cb788d09?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/d41ae100b7810c1fed509c4c3395c60d3fad3d5dc755b9141c708e77cb788d09?s=96&d=mm&r=g","caption":"Melanie Imhof"},"url":"https:\/\/blog.zhaw.ch\/datascience\/author\/imhf\/"}]}},"_links":{"self":[{"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/posts\/148","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/users\/44"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/comments?post=148"}],"version-history":[{"count":5,"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/posts\/148\/revisions"}],"predecessor-version":[{"id":716,"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/posts\/148\/revisions\/716"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/media\/699"}],"wp:attachment":[{"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/media?parent=148"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/categories?post=148"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/tags?post=148"},{"taxonomy":"features","embeddable":true,"href":"https:\/\/blog.zhaw.ch\/datascience\/wp-json\/wp\/v2\/features?post=148"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}