United States patent 5,983,227 nazem, et al. November 9, 1999
Dynamic Page Generator
Abstract
An custom page server is provided with user preferences organized into templates stored in compact data structures and the live data used to fill the templates stored local to the page server which is handing user requests for custom pages. One process is executed on the page server for every request. The process is provided a user template for the user making the request, where the user template is either generated from user preferences or retrieved from a cache of recently used user templates. Each user process is provided access to a large region of shared memory which contains all of the live data needed to fill any user template. Typically, the pages served are news pages, giving the user a custom selection of stock quotes, news headlines, sports scores, weather, and the like. With the live data Stored in a Local, Shared Memory, Any Custom Page Can Be Built Wtem, Eliminating the Need To Make Requests from Other Servers for Portions of the Live Data. While The SHA Red Memory Might Include Ram (Random Access Memory) and Disk Storage, In Many Computer Systems, IT IS Faster To Store All The Live Data In Ram.
Inventors: Nazem; Farzad (Redwood City, CA); Patel; ASHVINKUMAR P (Milpitas, CA) Assigne: Yahoo, Inc. (Santa Clara, CA) Appl. NO. No .: 873975Filed: June 12, 1997
CURRENT US Class: 707/10; 705/1; 705/10; 707/104.1; 709/217; 715/500; 715/513; 715/517 Intern'l Class: G06F 017 / 30Field of Search: 707/10, 104, 200, 500, 513, 517 395 / 200.47,200.48,200.49 705 / 1,10
References Cited [Reference By]
U.S. Patent Documents
5754938May., 1998Herz et al.455 / 4. 5754939May., 1998Hertz et al.455 / 4. 5761662Jun., 1998Dasan707 / 10. 5793497Aug., 1998Funk358 / 402. 5793972Aug., 1998Shane395 / 200. 5835087Nov., 1998Herz et al. 345/327.
Garris, John, "Grab That Database", PC Magazine, Vol. 15, No. 15, Sep. 10, 1996, P. NE1-Ne7. Chesnais et al., "The Fishwrap Personalized News System", IEEE, D- 7803-27560X / 95, Jun., 1995, PP. 275-282. Yuri Quintana, University of Western Ontario, "Knowledge-based Information Filtering of Financial Information," XP-002057953, May, 1997, PP. 279-285.
PRIMARY EXAMINER: LINTZ; Paul R.
ATTORNEY, Agent OR Firm: Albert; Philip H. Townsend and Townsend and Crew LLP
Claims
What is claimed is:
1. A Server System for Serving a Customized Information Page to a USER AT A Browser On Request, Comprising:
A Database of User Configurations;
Means for obtaining information from the user refresting a record in the database of user configurations, the record being a record for the user;
Means for generating a template based on the record for the user;
A Plurality of User Processes, One Process Per User Accessing The Server System;
A Shared Memory;
A Data Structure With, a Data Structure for Storing The Dynamic Information Needed To Generate;
Response mens for responding to a request with an information page contact Dynamic Information from the data structure formatted agent;
configuration generating means for generating default user configurations based on user demographic information, including means for determining a default list of cities for a weather report and sports teams for sports reporting based on the user demographic information, wherein the means for determining the default list of cities And default lists of sports teams uses user postal code information; means for translating postal code information to user geographic position;
Means for Comparing the User Geographic Position To Geographic Positions Assigned to Each City and Sports Team; And Sports Team;
means for determining a threshold distance from the user geographic position that is greater than or equal to a distance to a predetermined nonzero number of cities and a predetermined nonzero number of sports team geographic positions.
2. USING A Page Server, A Method of Providing Real-Time Response User Requests for Customized Pages, The Method Comprising The Steps of:
Obtaining User Preferences, Wherein a User's User Preferences Indicate Items of Interest To That User;
Obtaining real-time information from information.
Storing The Real-Time Information in A Storage Device;
Combining the user preferences for the user and a template to form a template program specific to the user;
Receiving, from a user and at the server, a user request for a customer oming to the user preferences
EXECUTINTING THE User Uns
providing the user with the customized page, wherein the steps of executing and providing are performed in real-time response to receipt of the user request in the step of receiving and wherein the customized page includes at least one item of real-time information selected from The Storage Device.3. The Method of Claim 2, Wherein The Real-Time Information Comprises Stock Quotes, Sports Scores and news headlines.
4. The method of claim 2, wherein the step of storing the real-time information in a storage device is a step of storing the real-time information in a memory having a capacity to simultaneously contain all of the real-time information that could Be Required for Execution of the Template Program.
5. The method of claim 2, wherein the server simultaneously handles more than one thousand user requests using a plurality of servers, the method further comprising a step of assigning a user request to one of the plurality of servers.
6. The Method of Claim 2, Further Comprising A Step of Template Program for Users for Whom User Requests Have Been Recently Serviced.
7. The Method of Claim 2, Further Comprising a Step of Generating a Default User Configuration for a User Based on Demographic Information of That User.
8. The Method of Claim 7, Wherein The Step of Generating A Default User Configuration Comprises The Steps of:
DETERMINING A Default List of Cities for a Weather Report Based ON User Demographic Information; And
Determining One or More Sports Teams for Sports Reporting Based ON User Demographic Information.
9. The Method of Claim 8, Wherein The Steps of Determining Comprise The Steps of:
Obtaining User Postal Code Information;
TRANSLATING THE POSTAL CODE Information;
comparing the user geographic position to geographic positions assigned to each city and team sports; anddetermining a threshold distance from the user geographic position which is greater than or equal to a distance to a predetermined nonzero number of cities and a predetermined nonzero number of sports team geographic Positions.
Description
Copyright Notice
A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the xerographic reproduction by anyone of the patent document or the patent disclosure in exactly the form it appears in the Patent and Trademark Office Patent File or Records, But Otherwise Reserves All Copyright Rights Whatsoever.
Background of the invention
The present invention relates to the field of customized information presentation. More specifically, one embodiment of the invention provides a custom page server which can quickly serve custom pages and is scalable to handle many users simultaneously.
Web servers for serving static documents ( "Web pages") over the global Internet are known. While static documents are useful in many applications where the information to be presented to each requesting user is the same, some applications require customization to appeal to users. For example, in presenting news to users, custom pages present news which is more relevant to the requesting users than static pages. With static pages, a user will often have to scroll through many topics not of interest to that user to get to the information Of Interest. With Custom Pages, The Information is Filtered According to Each User's Interest.
Customizing a server response based on the requester is known, however known systems do not scale well. One method of serving custom pages is to execute a script, such as a CGI (Common Gateway Interface) script, or other program to collect the information necessary to generate the custom page. For example, if the custom page is a news page containing stock quotes, sports scores and weather, the script might poll a quote server to obtain the quotes of interest, poll a sports score server to obtain the scores of interest and poll a weather server to obtain the weather. With this information, the server generates the custom page and returns it to the user. This approach is useful where there are not many requesters and where the attendant delay is acceptable to users. While it may be the case that current users are willing to wait while pages load in their browsers, growing impatience with waiting will turn users away from such servers, especially as use increases.One approach to avoiding long waits is to transfer the custom information in non-real-time, so that the information is stored local to the user as it arrives and is presented to the user on request. A disadvantage of such a system is that the networks used by the user become clogged with data continually streaming to the user and require large amounts of local storage. Another disadvantage is that the locally stored information will become out of date as the server receives new data.
From the above it is seen what an improved system for delivergeling customer
Summary of the invention
An improved custom page server is provided by virtue of the present invention. In one embodiment, user preferences are organized into templates stored in compact data structures and the live data used to fill the templates is stored local to the page server which is handing user requests for custom pages. One process is executed on the page server for every request. The process is provided a user template for the user making the request, where the user template is either generated from user preferences or retrieved from a cache of recently used user templates . Each user process is provided access to a large region of shared memory which contains all of the live data needed to fill any user template. Typically, the pages served are news pages, giving the user a custom selection of stock quotes, news headlines, Sports Scores, WEATHER, AND THE LIKE. WITH The Live Data Stored in a Local, Shared Memory, Any Custom Page Can Be Built With Ued To Make Reques ts from other servers for portions of the live data. While the shared memory might include RAM (random access memory) and disk storage, in many computer systems, it is faster to store all the live data in RAM.If the volume of requests becomes too great for one page server to handle, the system is easily scaled by adding additional page servers. Each page server maintains its own copy of the live data in its shared memory, and needs to maintain only the user templates for the requests it is handling SO No Communication Between Page Servers is needed.
............................. ...CRIPLITRIFPHTRISTRATION STUDIES. SIGE.
Brief Description of the Drawingsfig. 1 IS A Block Diagram of a Client-Server Architecture According To One Embodiment of the Present Invention;
Fig. 2 IS a Schematic Diagram Showing How A User's Custom Page IS generated, A Global Template and Live Data.
Fig. 3 IS An Illustration of A Global Front Page Template AS Might Be Used To Generate User Templates.
.
Fig. 5 is an illustration of a user page generated using the global template of Fig. 4.
Fig. 6 is an illustration of how intelligent defaults are selected.
Two appendices are included at the end of this description. Appendix A is a full listing of the user template shown in part in FIG. 4. Appendix B is an HTML source code listing of the HTML page used to generate the browser display shown in FIG .
Description of the Preferred Embodiments
FIG. 1 shows a client-server system 100 which is used to display custom news pages. A custom news page is displayed on a browser 102 which obtains the page from a page server 104 via Internet 106. While only one browser 102 is shown, A Typical System Will Have Many Browsers Connecting and disconnecting to the system.
The art of client-server systems using HTTP (HyperText Transport Protocol) messaging or other protocols is well known and will not be addressed in detail here. Essentially, browser 102 makes a request for a particular page by specifying a Uniform Resource Locator ( "URL ") for the need show. in the example........................................... yahoo.com. As is the convention in Internet communications, browser 102 submits the domain portion ( "my.yahoo.com") of the URL to a name server, such as name server 108, to determine an actual address for the page server 104. name server 108 returns an IP (Internet Protocol) address to browser 102 directing it to a page server 104. Where multiple page servers 104 are used, name server 108 returns IP addresses in a round-robin fashion to distribute the load over multiple Page Servers. Alternative, Name Server 108 Might Distribute the loading mor e deterministic by tracking browser addresses and hashing the browser address to select a page server 104. It is deterministic in that any given browser always accesses the same page server 104. This allows for more efficient caching of user templates, since more cache hits are likely where a given browser always returns to one page server.When a page server receives the URL for its root directory, it interprets that as a request for the user's custom summary page. The user is determined not from the URL, but from a "cookie .
Page servers 104 obtain the live data from many disparate sources and reformat the data into a form suitable for use by the page server. Page servers 104 are coupled, via a network, to edit servers 112, which are used when a user changes his or her user template. The user templates are stored in a user configuration database 116 and are stored and provided to edit servers by a network appliance 114 written for this purpose. Network appliance 114 is a process tuned to quickly locate files in large directories (N400 files / directory) and return them to the edit servers or page servers. One embodiment of network appliance 114 is the F330 fault-tolerant scalable server supplied by Network Appliance, of Mountain View, Calif.In a specific embodiment, page servers 104 are microcomputers running The UNIX.rtm. Operating System With 64 To 128 Megabytes of Shared Memory, Page Servers 104 And Edit Servers 112 Are Conpled Using TCP / IP (Transport Control Protocol / Internet Protocol) and The User Confi guration database 116 is a Unix file structure which stores each user configuration in a text file. The particular file used by a user is a combination of the user's user name and a hash result, to allow for quick access when many user configurations are stored. For Example, The User Configuration for Summary "Front" page for a user "ASH802" might be stored at /de/13/y.ash802, where "de" AND "13" Are Hash Results of a hash of the user name " ASH802 ".
FIG. 2 shows in more detail the generation of a custom page for a user, using a front page generator 200 and page server 104. Front page generator 200 generates a user template 202 from a global front page template 204 and a user configuration record 206 . FIG. 3 shows an example of a global front page template. User configuration record 206 is a record selected from user configuration database 116. The record might have been obtained from a cache, but in the preferred embodiment, the records are not cached, the user templates are.Page server 104 is shown comprising a page generator 210, a shared memory 212 for storing live data and a cache 214 for caching user templates such as user template 202. Page generator 210 generates a custom front page 218 from a user template and the live data stored in shared memory 212. Although not shown, custom pages other than the front page can be generated in a similar fashion. Using user templates and a shared memory for the live data, page server 104 CAN Quickly Build Custom Pages in response to a user request. where the user template is cached, The page can be generated entirely with page server 104.
Shared memory 212 is organized as a set of memory mapped files. With memory mapped files, the operating system maintains the data in permanent storage, but permanently caches the files in shared memory 212. This allows for quick recovery should a page server crash, since all of the shared memory can simply be reloaded from the permanent storage. This is a great feature from a user convenience standpoint, since the user will never be faced with a situation where they have to wait for a server to rebuild a page for them by querying the various data providing servers, such as sports server 230, stock server 232 and news server 234. As should be apparent from this description and FIG. 2, page generator 210 can generate custom front page 218 much more quickly using shared memory 212 as Compared with using servers 230, 232, 234 and page Template 202. One Reason for this Is That Time It Takes To Retrieve Data from Shared Memory 212 Does Not Appreciably Increase Revel To The Bandwidth D elay time when more data is retrieved. For example, if stock server 232 were queried for each individual stock quote, a page with fifty stock quotes might take ten times as long to generate as a page with five stock quotes.
One aspect of the present invention is the realization that every piece of information a person can request on a page is storable in a shared memory closely coupled to a page generator. For example, page server 104 shown in FIG. 2 can accommodate 65,000 different symbols for which quotes are provided. in one embodiment, all of the stock information for all 65,000 symbols is stored in a 13 to 14 megabytes section of the shared memory. Where shared memory 212 is a 64 MB or 128 MB memory, this leaves sufficient room for other data, such as news headlines, sport scores, and memory used by the operating system for each process running on page server 104. In some embodiments, shared memory 212 is large enough to also accommodate more than just news headlines. for example, ..........................
As shown in FIG. 2, the user's front page template 202 does not need to be generated each time, but rather is stored in cache 214. In a preferred embodiment, user templates are stored in cache 214 for long enough to be reused. Some users might choose to access their front page only infrequently, while others might choose to access their front page hourly. Since the pages are customized and dynamic, the user would see different information each time, but the same user template would be used each time. of course, when the user edits his or her template, any cached copy of a user template is flushed. A garbage-collection process may also flush the cache of user pages which have been inactive for several days. In one implementation, cache 214 would accommodate 60,000 to 70,000 user templates. Where a particular page server is assigned on a random round robin basis, multiple page servers may cache their own copy of a given user template, but where a user is directed always to a particular s erver (except in the case where the particular server fails and a secondary server is used), that page server will be the only one which needs to cache that users user template. Even where the round robin name server scheme is used, some browsers may cache IP addresses, even longer than the specified "time to live" variable associated with the IP address, in order to save the time required to obtain an IP address each time. with such a browser, the user is effectively directed to the same page Server Each Time and The Server Side of the page Serving system does not need to direct users to particular page servers, however, the "time to live"
variable is honored and new requests are made for IP addresses after the "time to live" has expired. In these cases, if the assignment of a user to a single page server is desired, name server 108 (see FIG. 1) will use the user name from the provided cookie or the user's IP address to assign a page server based on a hash of the user name or IP address.FIG. 3 is an illustration of global user template 204. Global user template 204 is an HTML (HyperText Markup Language) Document With Additional Tags as PlaceHolders for Live Data. Several PlaceHolders 302 Are Shown In Fig. 3.
FIG. 4 is an illustration of user template 202 as might be generated from global user template 204 (see FIG. 3) and a user configuration record 206. A full listing of user template 202 is included herewith in Appendix A. User template 202 is determined by the user configuration and is independent of the live data, therefore it can be cached without needing to be updated, unless the user chooses to edit the configuration information. Preferably, the user templates are cached rather than the user configuration, to save a step and reduce the time to respond to a request for the page. Caching is more effective where the typical user makes several requests in a short time span and then does not make any requests for a long period of time.
Essentially, user template 202 contains the information about the user which does not change until the user changes his or her preferences. Of course, the system operator could choose to make changes to how the system operates, thus requiring changes to the user preferences and user templates. User template 202 is shown comprising internal variables such as a time zone and demographic information. The demographic information, on the second line in FIG. 4 is used for selection of an advertisement which will be part of the custom page. in this example , the advertisement is targeted by the demographic information in the user template ": M, 85,95035, T, *" indicating that a suitable ad should be targeted to a male user, age 85, located in zip code 95035, etc. As shown, the portfolio section contains selected stock symbols, the scoreboard section contains selected team symbols, and the weather section contains selected weather cities / zip codes.The selections of stock quote symbols, team scores, and weather cities are set by the user. In a preferred embodiment, intelligent defaults are selected by the system prior to user selection, so that users unfamiliar with the customization process will nonetheless be able to view non empty custom pages. This is described in further Detail Below in Connection with Fig. 6.
FIG. 5 is an illustration of a user front page 218 returned by page server 104. User front page 218 as shown in FIG. 5 includes many elements, some of which are described here in further detail. User front page 218 is built according to a user template and live data. The user template specifies, for example which quotes are shown in the portfolio module, which cities are displayed in the weather module, etc. Each of the modules 504 can be customized by a user and moved about front page 218. The modules 504 are also reusable, in that any customized module which appears on multiple pages can be edited from any one of those pages and the edits will be reflected on each of the pages. Other custom pages for the user can be viewed by selecting one of the page buttons 502 appearing below the header. Other pages and utilities can be selected using the buttons 508 which are part of the header.In addition to all of the live date shown in FIG. 5 being stored in the shared memory, Summaries from e ach of the major news topics can also be stored in the shared memory and viewed by pressing on the news topic header, such as news topic header 506. As should be noted, the page generator can also intelligently display dates 510 customized for a particular user , using a time zone variable in the user template. This time zone variable is shown as the first line in user template 202 in FIG. 4. in addition to being able to modify each of the modules, in many cases the order of appearance of ........................................
The preference editing process can be initiated by the user pressing the appropriate edit button 512. As explained above, once the editing process is complete, the user template is flushed from the cache and regenerated. Since each of the news stories is essentially a static page LINKED TO A Headline Shown in the news
Referring now to FIG. 6, an illustration of intelligent defaulting for populating a user template, and consequently a user summary page. As part of a registration process, a user indicates, among other things, his or her zip code. This zip code is used to locate an approximate longitude and latitude for the user using a zip code lookup table 602. This allows the user's location to be located on a map 604. Map 604 provides city boundaries and, with team location table 606, also provides locations for various sports teams which can be selected in a sports module. in selecting a default predetermined number of cities and sports teams for inclusion as initial selections for a particular user, a circle is drawn around the user and increased in diameter until the circle envelopes a suitable predetermined Number of Cities and Sports Teams. in this way interested in only the teams nearest the user, the system can be arranged to provide intelligent defaults where geographic anomalies are known to exist. Geographic anomalies occur in communities which have more loyalty to distant teams than nearby teams, such as might occur when the distant team Is Much Better THEN THE NEARBY TEAM or WHEN THEARBY TEAM Recently Moved to a distant location. in Any Case, The User Is Allowed Customize His Or Her Pages Beginning with The Intelligent Defaults SELECTED.
Other intelligent defaults can be provided in other contexts. For example, the header of user front page 218 includes a button 508 labelled "myweb" which, when pressed, would lead the user to a custom listing of web sites. The initial defaults for that custom listing of web sites might be generated based on the keywords of interest to that user or based on the news topics, sports teams or weather cities selected by the user.The above description is illustrative and not restrictive. Many variations of the invention will become apparent to those of skill in the art upon review of this disclosure. The scope of the invention should, therefore, be determined not with reference to the above description, but instead should be determined with reference to the appended claims along with their full scope of Equivalents.
Appendix a
______________________________________________________________
Listing of a User Template
______________________________________________________________
Portfolio: Quotes, Pf.sub. - 1, 1, DJI, NYA, IXIC, SPX, XAX, Yhoo, NSCP, IOM, NSC
P, yhoo
2836->
Scoreboard: NCAAFSSC, NHLSAN, MLSSAN, NCAAFSSS, ALOAK, NBAGSW, NFLOAK, NCAAFCCD, NF
LSFO, NLSFO 3803 ->
Mode.SUB. - bar: "front.sub. - Page1",. Hier = news summary% 3AEDIT & .DOM = HTTP:
//my.yahoo.com/
News / summary.html? v 4538 ->
hEAD>