Patent application number | Description | Published |
20080256068 | METHOD AND SYSTEM FOR CALCULATING IMPORTANCE OF A BLOCK WITHIN A DISPLAY PAGE - A method and system for identifying the importance of information areas of a display page. An importance system identifies information areas or blocks of a web page. A block of a web page represents an area of the web page that appears to relate to a similar topic. The importance system provides the characteristics or features of a block to an importance function that generates an indication of the importance of that block to its web page. The importance system “learns” the importance function by generating a model based on the features of blocks and the user-specified importance of those blocks. To learn the importance function, the importance system asks users to provide an indication of the importance of blocks of web pages in a collection of web pages. | 10-16-2008 |
20100169300 | Ranking Oriented Query Clustering and Applications - Techniques described herein allow for suggesting creation of tools for improving search engine performance. Specifically, these tools focus on producing more relevant search engine results via a URL-based query clustering method. These tools first extract tokens from Uniform Resource Locators associated to search queries. With these tokens, these tools form query clusters of common tokens. The resulting clusters can be used to help understand the similarities in user search queries via URL-based cluster queries to produce more relevant search results. | 07-01-2010 |
20110191381 | Interactive System for Extracting Data from a Website - Described is a technology for efficiently labeling a webpage. A wrapper tool labels records of a webpage at the record level. If an existing wrapper exists that is appropriate for labeling a record, the wrapper tool automatically labels that record. For unlabeled records, the tool provides a user interface to label those records, and updates the set of existing wrappers with a new wrapper that is generated based upon the labeling operation; the new wrapper is then applied to any unlabeled records if appropriate for those records. As a result, a user typically needs only to label a relatively few records, with the wrappers generated for those records automatically used to label the other unlabeled records of the webpage. | 08-04-2011 |
20110209048 | INTERACTIVE SYNCHRONIZATION OF WEB DATA AND SPREADSHEETS - Interactive synchronization of Web data and spreadsheets is usable to build data wrappers based on any type of data found in a document. Such data wrappers can be used to interact with source documents, crawl a network for additional data, map data from across domains, and/or synchronize data from dynamic Web documents. | 08-25-2011 |
20110238644 | Using Anchor Text With Hyperlink Structures for Web Searches - This document describes tools for adjusting anchor text weight to provide more relevant search engine results. Specifically, these tools take advantage of a site-relationship model to consider relationships not only between an anchor text source site and a destination page but also relationships between multiple anchor text source sites to improve web searches. Consideration of these relationships aids in determining a new an anchor text weight, which in turn results in more relevant search results. | 09-29-2011 |
20120109950 | METHOD AND SYSTEM FOR CALCULATING IMPORTANCE OF A BLOCK WITHIN A DISPLAY PAGE - A method and system for identifying the importance of information areas of a display page. An importance system identifies information areas or blocks of a web page. A block of a web page represents an area of the web page that appears to relate to a similar topic. The importance system provides the characteristics or features of a block to an importance function that generates an indication of the importance of that block to its web page. The importance system “learns” the importance function by generating a model based on the features of blocks and the user-specified importance of those blocks. To learn the importance function, the importance system asks users to provide an indication of the importance of blocks of web pages in a collection of web pages. | 05-03-2012 |
20120212862 | METHOD AND DEVICE FOR LIMITING SECONDARY ARC CURRENT OF EXTRA-HIGH VOLTAGE/ULTRA-HIGH VOLTAGE DOUBLE CIRCUIT LINES ON THE SAME TOWER - A method and a device for limiting secondary arc current of an extra-high voltage/ultra-high voltage double circuit line on the same tower. The method comprises the following steps: determining the type of a single-phase-to-ground fault when the extra-high voltage/ultra-high voltage double circuit line on the same tower has a single-phase-to-ground fault (S | 08-23-2012 |
20130173605 | Extracting Query Dimensions from Search Results - Techniques are described for automatically mining query dimensions from web pages resulting from execution of a search query. Lists of items such as words, terms, or phrases are extracted from the web pages based on the recognition of free text, metadata tag, or repeated region patterns within the web page text. Extracted item lists are weighted according to document matching and/or inverse document frequency, and item lists are clustered based on shared or similar items within the lists to generate query dimensions. The generated query dimensions, and the items within each query dimension, are ranked according to quality, and high-quality query dimensions are provided for display alongside top search results. | 07-04-2013 |
20130257166 | Method, Apparatus and System for Suppressing Low Frequency Oscillation in Power System - A method, apparatus and system for suppressing low frequency oscillation in a power system. The method comprises: determining a system transfer function of an interconnected power system section in which a variable frequency transformer (VFT) is located; determining a damping controller parameters according to the system transfer function; and suppressing low frequency oscillation of the power system by means of the VFT based on the damping controller parameter. The objects of the method, apparatus and system are definite: optimizing the damping controller parameter can be achieved by simply tracking and analyzing the response of the system to disturbance, without the need to understand the configuration and parameters of the system or solve complicated power system equations, which has a better effect in suppressing low frequency oscillation in the power system and is advantageous for improving the safety and stability level of the power grid. | 10-03-2013 |
20140207746 | Adaptive Query Suggestion - When a user-submitted query is received, a set of candidate queries is identified. For each of the candidate queries, features are extracted that, for each candidate query, reflect a measure of effectiveness of the candidate query. The candidate queries are rank ordered based on the measure of effectiveness, and one or more of the top-ranked candidate queries are presented as suggested alternatives to the user-submitted query. | 07-24-2014 |
20150063696 | DETERMINING IMAGES OF ARTICLE FOR EXTRACTION - A content application determines images of an article for extraction. The content application identifies an initial image associated with a content of the article. A caption and a credit line associated with the initial image is detected and the initial image is extracted along with the caption and the credit line. A second image of the article associated with a video is also detected and extracted along with the video. In addition, the content application extracts a slideshow detected within the article. | 03-05-2015 |
20150067476 | TITLE AND BODY EXTRACTION FROM WEB PAGE - Technologies are generally provided for extracting a body and a title of an article displayed on a web page. A web page may display content such as advertisements, images and links in addition to the web page article. A user may select to view the article in a reader application without the additional content, and the reader application may extract the body and the title from the web page. Title candidates may be selected by identifying meta tags associated with the title and removing website names from the meta tags. Body candidates may be selected by identifying clusters of text nodes based on a font size and depth in a document object model tree for the web page. A best cluster that is most likely the body may be selected and a corresponding title candidate maybe selected as the best title. | 03-05-2015 |