ISSN 1566-6379

First published
in 2003


   

Paper 20 - Issue 2

Home Papers in this Issue Previous Issues Site Map

    .

Home
About the Journal
Scope
Editorial Board
Submission Guidelines
Call for Papers
Join the Committee

ECITE: European Conference on Information Technology Evaluation

Impact of Content-based Personalization in a News Environment. A Case Study. Erik Wallin, Royal Institute of Technology (KTH), Stockholm, Sweden. erikw@kth.se
   
1.         Introduction

This paper describes the implementation of a content-adapting personalization system at the website of the second largest Swedish daily newspaper, the Svenska Dagbladet (SvD, 2001). The system utilizes content descriptions provided by the journalists to personalize information to the readers. The journalists categorize the material according to a modified news categorization standard and the readers are exposed to a special selection that is personalized on an individual basis. The reader’s behavior is recorded and used to optimize the business objectives.

With an increasing information supply, the use of personalization promises timesaving and better matching of a given service or product to preference (Peppers et al., 1999). The term personalization is used to describe the computer-supported process of adapting an information flow and its presentation in real time to each individual user’s behavior (Wallin et al., 1998).

Businesses recognize an opportunity to increase user satisfaction by learning from the user’s behavior and thus being able to pinpoint individual preferences and adapt the communication accordingly to increase business efficiency. To increase revenues the personalization system needs to influence user behavior so as to increase user activity (i.e. frequency) on the website. The primary source of revenues for the news services industry is advertising; the larger the audience (traffic volume), the higher the revenues. With the increasing cost of investment and operation of information technology systems, the need for cost-effective solutions increases and personalization systems must maintain cost efficient operation. (Peppers et al., 1999).

Examples of personalization of news services on the Internet reach back to early experiments in the beginning of the 1990s (Chesnais et al., 1995). The NY Times’ Internet edition has successfully been using methods to adapt information to different segments (Saunders, 2002). A popular method of adapting information is called content filtering (Foltz et al., 1992) and refers to the concept of selecting content for a user based on the user’s characteristics and a description of the content (Lassila, 1997). Turpeinen et al. (Turpeinen et al., 1996) describe a system for a personalized news service based on semantic descriptions of the news items and corresponding user profiles. Turpeinen concludes that personalization is important for news producers because it adds value to their users. The SmartPush project (Jokela et al., 2001) found an operational method of producing structured semantic metadata for news services in a manner that worked well with personalization.

2.         Objective

The primary objective of this paper is to evaluate if personalization provides a positive effect on measurable business objectives. The secondary objective is to find the parameters and settings that offer the best effects on business.

3.         Operational functionality of the personalization system

3.1       The basics of the personalization system

Personalization is based on the behavior of the user. Behavioral data is collected when a user interacts with different information objects in the system (e.g. an editorial article). Multiple interactions are gathered and saved as the user’s individual profile. When the system performs a personalization, the user’s profile is compared to profiles of the information objects and they are sorted accordingly. (Wallin, 2000).

3.2       System overview

All notations in brackets () refer to Figure 1. The personalization system (E) is a stand-alone system that is connected to the web system of SvD (A).

3.2.1   System implementation at the SvD (A) consists of:

(B)       Presentation layer. At the SvD this is the web page that the user sees.

(C)       Application layer. At the SvD this is a COM component together with an ASP-script (Winnick Cluts, 1997)

(D)       Data layer. At the SvD this is an SQL database.

(E)       Personalization system.

3.2.2   User identification

The personalization system receives the user identification with every interaction between the personalization system and the SvD website. The personalization system uses SvD’s user identification mechanism which consists of an anonymous unique identifier in the form of a cookie (Kristol et al., 1997) that is placed on a user’s computer the first time the user visits the SvD website.

3.2.3   Profiles and metadata

The personalization system uses two kinds of metadata profiles (Lennstrand et al., 1999) for the personalization process.

§          Article profile – describes the editorial articles.

§          User profile – describes the individual user’s characteristics based on historical behavior.

The descriptions are stored as vectors (Wallin et al., 2000) representing different aspects of the article (Wallin et al., 1998).

3.3       Processes

All notations in brackets () refer to Figure 1.

3.3.1   Personalization of the website:

(1)       A user enters the SvD website and requests a personalized object (e.g. editorial articles).

(2)       The application layer (C) requests the desired articles from the data layer (D).

(3)       The requested articles are returned to the application layer (C)

(4)       The application layer sends the articles to the personalization system (E).

(5)       The articles are sorted (Wallin et al., 2000) according to the user’s profile.

(6)       The personalized (sorted) articles are returned to the application layer (C).

(7)       The application layer (C) delivers the personalized articles to the presentation layer (B). The user receives a personalized web page.

Figure 1: System overview

3.3.2   Recording the user’s behavior:

I.          A user enters the SvD website and requests an object (e.g. some editorial articles) that the personalization system has knowledge of.

II.          The application layer (C) sends a notification to the personalization system (E) that a specific user has interacted with a specific article.

III.         The personalization system (E) records the event (Wallin et al., 2000)

3.3.3   Personalization (sorting)

The process of sorting (5) the articles according to the users profile is done with an exchangeable algorithm (Wallin et al., 2000). The algorithm used at SvD is of a shortest distance type (Russel et al., 1995).

4.         Implementation at the SvD

During one week (11/19/2001 to 11/25/2001) a test of the personalization system was performed on the SvD website. The personalization system had been implemented in August 2001.

4.1       Business objectives

SvD’s primary business objective for the personalization system was to increase the average number of page impressions a user demanded per session. Page impressions are defined as the total number of downloads of web pages from a website (KIA, 2003).

4.2       Personalized functions

The selection of the different functions and areas was made in cooperation with SvD. One of SvD’s requests was that one area should be personalized on the first web page to address the problem that the majority of the visitors left the site after only viewing the first page.

A second area was at the end of an individual editorial article. The logic behind this function was the assumption that when a user had finished reading an article, a personalized selection of articles would increase the chance that the user would choose to read more.

4.2.1   The first page

In the upper-right hand corner of the first web page a small area was used to display a section name as header in a large font and the heading of the latest editorial article in that section in a smaller font. The smaller text was a hyperlink to the selected article. The articles used were limited to the three most recently published articles from the different sections. (Figure 2).

 

Figure 2.

4.2.2   Article content pages

At the bottom of all article content pages a list of five other article headers was displayed as hyperlinks to the respective articles. (Figure 3).

 

Figure 3.

4.3       Metadata structure

The personalization operation is based on two metadata descriptions of the news material, (1) Category (e.g. sports, entertainment, international news etc.) and (2) Geographic location (e.g. Stockholm, Sweden, Europe, etc.).

For the category description SvD is using a TT NITF metadata structure (Stadler et al., 2003). The TT NITF metadata structure originates from the International Press Telecommunications Council’s (ITPC) commonly used standard for categorization of news material called NITF (News Industry Text Format) (Karben, 2003). SvD uses the top-level of the TT NITF and has made minor modifications to fit their internal information structure. The geographic location description was retrieved from SvD’s internal information structure.

A journalist who has written an article also encodes it with metadata. Some articles from other sources with compatible TT NITF metadata encoding are imported to SvD.

5.         Method

5.1       Delimitation

The results regarding the benefits of personalization are delimited to quantitative measures without any qualitative input from the end-users.

5.2       Data collection

To measure the effect of the personalization system with regards to the business objectives, a method called bifurcation (Lennstrand et al., 1999) is used to stochastically divide the users into two or more different groups and then compare the outcome between the groups and analyze the results. (Wallin et al., 2000)

The results regarding the performance of the personalization system were gathered in an SQL-database in real-time and presented via a report subsystem similar to OLAP (On-Line Analytical Processing).

5.3       Data selection

During one week (11/19/2001 to 11/25/2001) a test of the personalization system was performed on the SvD website (SvD, 2001). The personalization system had been implemented in August 2001. This period was chosen because it was the only one with minor operational interruptions. The interruptions originated from difficulties with the integration of a new system into a complex and heavily burdened system environment. The user profiles originated from the beginning of the implementation and were not erased at the start of the test.

The test was limited to three groups with active personalization and one group with no personalization (the reference group). The users in the reference group received articles selected alphabetically rather than based on individual personalization. A total of only four groups was chosen to reduce the analytical complexity of the results. The personalization system stochastically assigned every single user with one of four different predefined personalization configurations that the user kept throughout the test period.

Table 1: Personalization Configurations used.

Table 1: Traffic volume represents the distribution of the users to the different configurations.

Personalization group Traffic volume Algorithm @ weight End update weight
1_33 25% AL 2.0 @ 33% 50%
2_67 25% AL 2.0 @ 67% 50%
3_100 25% AL 2.0 @ 100% 50%
Reference 25% Null 50%

The weights in the algorithm field represent the weight with which the profile representing the user’s interactions during the active session is merged with the profile representing the user’s historical behavior, i.e. the volatility of the system. The same matching algorithm (AL 2.0) was used in all configurations.

The End update weight represents the weight with which the user’s historical profile is updated at the end of the session to reflect the session behavior, i.e. the articles the user has interacted with during the session. The End update weight parameter was not the subject of the test and therefore heuristically chosen.

The implication of the different algorithm weights is that a lower number (e.g. 33%) means that the personalization system takes greater consideration of the historical behavior. A setting of 100% provides an entirely session-dependent system, or in other words an extremely volatile system.

During the test, all interactions performed by the users were logged, including session length and requests for articles

5.4       Validation

The integrity of the built-in report system of the personalization system was verified by random manual comparison calculations. In this test no data has been subtracted or corrected.

6.         Results from SvD

Sessions = Total number of times web pages have been downloaded from a website by a web browser on a coherent occasion (KIA, 2003).

Interaction = A user clicks on an article known by the personalization system.

6.1       Key figures

The test was performed during November 19–25, 2001.

There was a total of 81,305 sessions with interactions.

There were 239,676 sessions in total according to RedSheriff (RedSheriff, 2003), the official measurement system of web traffic at SvD. Thus, 34% (= 81,305/239,676) of the sessions at SvD had interactions.

6.2       Result per personalization group vs. reference

Three different personalization groups were compared with each other and a non-personalized reference group with regards to average interactions per session and with a 24-hour sample size (Figure 4).

The 33% session weight personalization group is between -4.6 % and 11.6% better than the non-personalized group with regards to average interactions per session. The average for the week is 1.0% better than the non-personalized group.

The 67% session weight personalization group is between 1.3 % and 29.5% better than the non-personalized group with regards to average interactions per session. The average for the week is 10.0% better than the non-personalized group.

The 100% session weight personalization group is between -4.3 % and 19.3% better than the non-personalized group with regards to average interactions per session. The average for the week is 8.0% better than the non-personalized group.

 

Figure 4: The average number of interactions per session and day for each personalization group.

6.3       Statistical significance

In order to test the statistical significance of personalization group differences, a one-way analysis of variance (ANOVA) was carried out with the number of interactions per session as the dependent variable and the personalization group as the group factor (table 2). Prior to carrying out the ANOVA test, a contrast test was specified between the reference group and all other groups combined. The ANOVA indicated that the group difference was statistically significant (F[3,81301]=10.24; p<.001). The contrast between the reference group and all other groups was also statistically significant (t= ﷓4.21; DF=51055; p<.001 assuming unequal variances).

Table 2: Means and standard deviations of interactions per session by groups.

Personalization group Traffic volume

(sessions)

Interactions
Mean
Standard deviation
1_33 20,069 1.94 1.93
2_67 20,339 2.12 5.21
3_100 20,073 2.09 6.15
Reference 20,824 1.93 3.18
Total 81,305 2.00 2.17

The number of interactions per session was skewly distributed in all groups, and the assumption of equality of variances in the ANOVA analysis was violated (Levene's test of equality of variances[3,81301]=18.16; p<.001). However, a nonparametric test of group differences, Kruskal-Wallis' test also showed a significant group difference (c2[3]=30.0; p<.001).

The group differences were further explored in the number of interactions per session using a categorization of the number of interactions per session variable into six categories:

One interaction

Two interactions

Three interactions

4–5 interactions

6–10 interactions

11 or more interactions (385)

Table 3 shows a breakdown of the number of sessions reduced to six categories, on personalization groups. The overall effect of personalization on the number of interactions per session was significant also when categorized in this manner (c2[15]=59.6; p<.001).

Table 3: Cross-tabulation of number of sessions in six categories by personalization group.

Number of
interactions
  33% 67% 100% Reference Total
1. 1 Number of sessions 11,440 11,428 11,493 12,126 46,487
  % within group 57.0% 56.2% 57.3% 58.2% 57.2%
2. 2 Number of sessions 4,275 4,284 4,278 4,466 17,303
  % within group 21.3% 21.1% 21.3% 21.4% 21.3%
3. 3 Number of sessions 2,034 2,165 2,031 2,055 8,285
  % within group 10.1% 10.6% 10.1% 9.9% 10.2%
4. 4–5 Number of sessions 1,552 1,591 1,539 1,457 6,139
  % within group 7.7% 7.8% 7.7% 7.0% 7.6%
5. 6–10 Number of sessions 687 736 611 616 2,650
  % within group 3.4% 3.6% 3.0% 3.0% 3.3%
6. 11–385 Number of sessions 81 135 121 104 441
  % within group 0.4% 0.7% 0.6% 0.5% 0.5%
Total Number of sessions 20,069 20,339 20,073 20,824 81,305
  % within group 100.0% 100.0% 100.0% 100.0% 100.0%

If the categorization groups 1 and 2 are combined (1–2 interactions) and compared with categorization groups 3, 4, 5, and 6 combined (3 or more interactions), then the number of sessions is distributed as in table 4.

Table 4: Cross-tabulation of number of sessions in two categories by personalization group.

Number of
interactions
  33% 67% 100% Reference Total
1–2 Number of sessions 15,715 15,712 15,771 16,592 63,790
  % within group 78.3% 77.3% 78.6% 79.7% 78.5%
3 or more Number of sessions 4,354 4,627 4,302 4,232 17,515
  % within group 21.7% 22.7% 21.4% 20.3% 21.5%
Total Number of sessions 20,069 20,339 20,073 20,824 81,305
  % within group 100.0% 100.0% 100.0% 100.0% 100.0%

In the personalized groups it was between 5.5% and 11.9% more sessions that belonged to the “3 or more interactions” than the reference group. The personalized groups had a dislocation towards more interactions per session than the reference group.

The overall effect of personalization on the increase of average interactions per session was thus statistically significant by all of the comparisons made (parametric, non-parametric, and categorical).

6.4       Turning off the personalization on the first page

After the test, the personalization on only the first page was turned off because of internal maintenance at SvD and the results of the personalized groups compared to the non-personalized group dropped significantly.

7.         Conclusions and discussion

Due to the short test period (one week) the results should be interpreted carefully and treated only as indicative.

The test indicates that personalization works.

It appears as if a medium to high session-consideration is preferable for obtaining the business objective of increased traffic volume to the website. The configuration with greater consideration of the user’s history only performed slighter better than the non-personalized group. All of the personalized groups had a dislocation towards more interactions per session than the reference group.

It appears as if placement on the web page is an important aspect of successful personalization; if a personalized function is well exposed it seems like it has a significant impact on the increase of reading volume.

7.1       Possible problems and errors

7.1.1   The history of the user profiles

The user profiles originated from the beginning of the implementation and were not erased at the start of the test. My interpretation is that this was positive for the test, as it would have been difficult to test the impact of history with only one-week-old user profile data.

7.1.2   Metadata

Possible problems with metadata coding:

1)         Missing metadata. On occasion, the operators (journalists) did not encode the articles with metadata. Missing articles are estimated to be less than 0.5% of the total number of articles.

2)         The possibility that the operators have coded articles wrongly or inconsistently.

3)         Inefficient or erroneous structure of the metadata with regards to the business objectives.

The level of incorrectness in personalization due to erroneous metadata has not been taken into account.

7.1.3   Known technical disturbances

During the test period the communication between the personalization system and the SvD website was lost for a few hours on 11/21/2001 due to technical disturbances from the systems surrounding the SvD website. The impact of these disturbances is considered minor.

7.1.4   Cookie identification

One problem with anonymous users is that it is the computer that is identified by a cookie and not the individual user. This means that the personalization is limited to a certain computer. Even on the same computer a cookie can be lost to deletion and thereby limit the effects of the personalization. If a computer has multiple users, the personalization will be an average of the behavior of the multiple users. If a single user uses multiple computers, the personalization system will treat the different computers as different users.

7.1.5   Search robots

No considerations of the effects on the results by search robots have been made.

7.2       Future works

Further investigations into the selection of personalization functions, where they should be placed, how to optimize the configuration of the personalization system, and finding an optimal metadata structure are needed both in the news-domain and in other types of domains. Additional longitudinal tests to confirm the effects of personalization may be suitable.

8.         Acknowledgments

This paper would not have been possible without the participation of all the employees at Adaptlogic AB and Johan Möller and Klas Sabelström at Svenska Dagbladet AB.

The statistical tests in the chapter Statistical significance was carried out with the assistance from Harald Janson (PhD., Department of Psychology, The University of Oslo (UiO)).

References

  • Chesnais, P. R., Mucklo, M. J., Sheena, J. A. (1995) “The Fishwrap Personalized News System” In the proceedings of the IEEE 2nd International Workshop on Community Networking Integrating Multimedia Services to the Home, Princetown, NJ.
  • Foltz, P.W. and Dumais, S.T. (1992) “Personalized Information Delivery: An Analysis of Information Filtering Methods.” Communications of the ACM, Vol 35, No.12, pp51-60.
  • Jokela, S., Turpeinen, M., Kurki, T., Savia, E., Sulonen, R. 2001. The Role of Structured Content in a Personalized News Service. Proceedings of the 34th Hawaii International Conference on System Sciences (CD/ROM), January 3–6 2001, Computer Society Press. 12 pages.
  • Karben, A. (2003) “Recent Modifications. Most Recent Changes to the NITF DTD”, [online], News Industry Text Format, http://www.nitf.org/site/recent-modifications.html
  • KIA – Kommitén för Internetannonsering (2003) Definitioner. http://www.mediacom.it-norr.se/23/kia/final_web/kommentarer.asp (Accessed 8 December 2003).
  • Kristol, D., Montulli, L., (1997) “HTTP State Management Mechanism.” Bell Laboratories, Lucent Technologies, Netscape Communications. RFC 2109.
  • Lassila, O. (1997) “Introduction to RDF Metadata”. W3C – World Wide Web Consortium. http://www.w3c.org/TR/NOTE-rdf-simple-intro-971113.html (Accessed 8 December 2003).
  • Lennstrand, B., Persson, C., Wallin, E., (1999) “Design and Prototyping of a Web-based System for Personalized E-Commerce and Consumer Behavior Research: The Sport Planet Case.” Presented at COTIM, University of Rhode Island, USA.
  • Peppers, D., Rogers, M., and Dorf, B. (1999) “The OneToOne Fieldbook: the complete toolkit for implementing a 1to1 marketing program.” ISBN 0-385-49369-X. Doubleday, New York.
  • RedSheriff (2003). http://www.redsheriff.com (Accessed 8 December 2003).
  • Russel, S., Norvig, P., (1995) “Artificial Intelligence; A Modern Approach.” ISBN 0-13-360124-2. Prentice Hall, Inc., New Jersey, USA.
  • Saunders, C. (2002) “NYTD Points to ‘Surround Session’ Benefits”. internetnews.com. http://www.internetnews.com/IAR/article.php/12_970991 (Accessed 8 December 2003).
  • Stadler, H. Lindgren, J. (2003) “TT NITF version 3.3“. Tidningarnas Telegrambyrå. http://www.tt.se/tekspec/ttnitfv3_3.doc (Accessed 8 December 2003).
  • SvD (2001). Svenska Dagbladet. http://www.svd.se (Accessed 8 December 2003).
  • Turpeinen, M., Saarela, J., Korkea-aho, M., Puskala, T., Sulonen, R. 1996. Architecture for Agent-Mediated Personalised News Services. PAAM’96: The First International Conference on Practical Application of Intelligent Agents and Multi-Agent Technology, 22–24 April, London, UK.
  • Wallin, E., Persson, C., (1998) “Using autonomous agents in Internet marketing; A discussion on subjective product characteristics in agent assisted trade” in Roger, J. P., et al (eds.), Technologies for the Information Society; Developments and Opportunities. IOS Press, Amsterdam, pp225–232.
  • Wallin, E., (1999) “Consumer personalization technologies for e-commerce on the Internet; A taxonomy.” in Roger, J. P., et al. (eds.), Business and work in the Information Society; New technologies and applications, IOS Press, Amsterdam, pp237–242.
  • Wallin, E., Persson, C., Lennstrand, B., (2000) “Web metrics; Design specifications of web-based system for personalization with bifurcation” TAGA 2000 Proceedings, Rochester, NY, USA.
  • Winnick Cluts, N. (1997) “An ASP You Can Grasp: The ABCs of Active Server Pages”. Microsoft Corporation, http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnasp/html/aspover.asp (Accessed 8 December 2003).
 
Copyright   © Erik Wallin, 2003  

Home Up Papers in this Issue Previous Issues Site Map

EJISE is published by Academic Conferences Limited
Curtis Farm, Kidmore End, Nr Reading RG4 9AY, England
Tel: +44 (0)1189 724148, Fax: +44 (0)1189 724691, Email: info@ejise.com

Send mail to info@academic-conferences.com with questions or comments about this web site.
Copyright © 2002-2006 Electronic Journal of Information Systems Evaluation
Last modified: September 29, 2005
ISSN 1566-6379