Patent application number | Description | Published |
20100185619 | Sampling analysis of search queries - Sampling analysis includes classifying a plurality of query keywords into a plurality of query keyword subsets according to page view (PV) values associated with the plurality of query keywords, the plurality of query keywords being submitted by a plurality of users; determining a respective plurality of sample rates of a respective plurality of query keywords in a respective one of the plurality of query keyword subsets; and sampling query data in the respective one of the plurality of query keyword subsets according to the respective plurality of sample rates. | 07-22-2010 |
20100325105 | Generating ranked search results using linear and nonlinear ranking models - Generating ranked search results includes receiving a plurality of matching information items that match a search request, ranking at least some of the plurality of matching information items using a linear ranking model that linearly combines a first plurality of feature values to obtain a first set of ranked results, ranking at least some of the first set of ranked results using a nonlinear ranking model that nonlinearly combines a second plurality of feature values to obtain a second set of ranked results, and provide a search response based on the second set of ranked results. | 12-23-2010 |
20110016111 | Ranking search results based on word weight - Ranking search results, comprises receiving a query string; retrieving a plurality of search results that include a corresponding plurality of target strings that relate to the query string; segmenting the query string and each of the plurality of target strings; pairing segments in the query string with respective segments in the target strings to form a plurality of combinations; retrieving a plurality of weights that correspond to the plurality of combinations based on a mapping of word combinations and their respective weights, wherein a weight measures semantic correlation between words in a word combination; and determining a weighted word length based on the weights corresponding to each of the plurality of target strings; and ranking the plurality of target strings based on their respective weighted word lengths. Alternatively, ranking search results includes determining a minimum weight of each inserted word with respect to segmented words in the query string; determining a minimum weight of each deleted word with respect to segmented words in the target strings; determining a total edit distance based at least in part on the minimum weight of each inserted word and the minimum weight of each deleted word; and ranking the target strings based on the total edit distances. | 01-20-2011 |
20110047138 | Method and Apparatus for Identifying Synonyms and Using Synonyms to Search - A method and an apparatus for identifying synonym and utilizing such synonym to conduct search is disclosed. The disclosed method includes: obtaining arbitrary two words to be identified; determining whether a shortest edit distance between the two words less than or equal to an edit distance threshold; determining whether the two words to be identified exist in a preset knowledge database, and if an answer is yes then searching a smallest granularity type with highest weight value for each word in the knowledge database; and if the two word have the same smallest granularity type with highest weight value, then determining such two words are synonyms, or non-synonym otherwise. The disclosed techniques greatly improve accuracy of synonym identification and guarantee effect of synonym identification. | 02-24-2011 |
20110060733 | Information retrieval based on semantic patterns of queries - An information retrieval method includes pre-processing a set of historical query information and processing a user query. Pre-processing a set of historical query information includes determining a plurality of semantic patterns based on a plurality of queries in the set of historical query information; establishing correspondence relationships between the plurality of semantic patterns and a plurality of filtering and ranking operations. Processing a user query comprises receiving the user query; retrieve a plurality of results in response to the user query; determining a semantic pattern that corresponds to the user query; determining a set of filtering and ranking operations that corresponds to the semantic pattern based on the correspondence relationships; and performing the set of filtering and ranking operations on the plurality of results to generate a set of filtered and ranked results. | 03-10-2011 |
20110060734 | Method and Apparatus of Knowledge Base Building - The present disclosure provides a method and apparatus of knowledge base building to automatically construct a knowledge base. Furthermore, the disclosed techniques can be used to improve the accuracy of that knowledge base. In one aspect, a method acquires a sentence from a webpage using a basic data processing layer of a computing apparatus. The acquired sentence is parsed into words using a data mining layer of the computing apparatus. One or more representative words in a first category of a knowledge base are matched with the words parsed from the acquired sentence. When there is a match between one of the representative words and one of the words parsed from the acquired sentence, a string of words adjacent the matched word in the acquired sentence is added to the first category as a first entry. When matching the words parsed from the acquired sentence with a second entry of a second category of the knowledge base, it is determined whether or not an established correlation exists between the first category and the second category. When it is determined that an established correlation exists between the first category and the second category, a correlation between the first entry of the first category and the second entry of the second category is established. The present disclosure also discloses methods for searching information and computing apparatuses that implement the methods. | 03-10-2011 |
20110082860 | Search Method, Apparatus and System - The present disclosure describes a search method, a search apparatus and a search system. The method includes: a data rewriting system that obtains, from a database, one or more search term candidates that are relevant to a present search term. The data rewriting system retrieves properties of the present search term and the one or more search term candidates, where the properties describe respective matching results of the present search term and the one or more search term candidates. Based on the matching results, the data rewriting system determines whether or not the present search term needs to be rewritten, and rewrites the present search term based on the matching results to provide a rewritten present search term if it is determined that the present search term needs to be rewritten. A search engine performs a search based on the rewritten present search term. The disclosed method, apparatus and system avoid the approach of conducting a search based on fixed rules after the present search term is rewritten, thus reducing the probability of having ambiguity in the search process and improving the degree of search accuracy. | 04-07-2011 |
20110225180 | Guided search based on query model - Providing guided search includes: receiving a new query; processing the new query to obtain a plurality of models, wherein a model indicates a mapping relationship of a previously stored query and corresponding guidance information; determining a corresponding plurality of similarities of the plurality of models relative to the new query; obtaining guidance information in a database comprising a plurality of mapping relationships of previously stored query and corresponding guidance information, including selecting at least some of the plurality of models based at least in part on the similarities and using the selected models as index to search the database; and sending the obtained guidance information to be displayed to a user. | 09-15-2011 |
20110295650 | Analyzing merchandise information for messiness - Analyzing merchandise information includes: receiving merchandise information input by a user; analyzing the merchandise information, including at least obtaining values corresponding to one or more characteristic attributes from the merchandise information, wherein the values corresponding to one or more characteristic attributes are used to determine whether the merchandise information is messy; determining a messiness confidence level associated with the merchandise information based at least in part on the obtained values corresponding to one or more characteristic attributes; and determining whether the messiness confidence level associated with the merchandise information exceeds a preset threshold value; in the event that the messiness confidence level exceeds the preset threshold value, sending an indication to stop publication of the merchandise information and in the event that the messiness confidence level does not exceed the preset threshold value, not sending an indication to stop publication of the merchandise information. | 12-01-2011 |
20110307839 | Implementing website navigation - Implementing website navigation includes deriving a confidence level for at least one leaf node of a website navigation category diagram using historical user operation data; generating one or more navigation hierarchical structure diagrams based on the website navigation category diagram; determining a searching cost associated with each of the one or more navigation hierarchical structure diagrams; determining the navigation hierarchical structure diagram associated with the lowest searching cost; and implementing at least in part a website navigation mode using the navigation hierarchical structure diagram associated with the lowest searching cost. | 12-15-2011 |
20120047148 | Method for Generating Search Result and System for Information Search - The present disclosure discloses a method for generating a search result and an information search system. The method for generating a search result includes: receiving, by an information search system, a search request; obtaining, by searching, a plurality of pieces of matching information that match the search request; obtaining a respective amount of user response associated with each of the plurality of pieces of matching information and further obtaining a total amount of user response associated with a respective categories to which each of the plurality of pieces of matching information belongs; and ranking the plurality of pieces of information to generate a search result based on the total amount of user response associated with the respective category to which each of the plurality of pieces of matching information belongs. By using the above technical scheme, a result of more rational ranking of matching information can be displayed to a user when the user performs a search, thus improving experience of the user. | 02-23-2012 |
20120330962 | Method and Apparatus of Providing Suggested Terms - The present disclosure discloses a method of providing suggested terms. The method includes: receiving an initial query input from a user, and obtaining corresponding suggested queries based on the initial query; determining at least two categories corresponding to the suggested queries and at least two clickable regions usable for looking up the suggested queries; separately determining a category weight associated with each obtained category in each clickable region for the suggested queries, and a click attribute weight associated with each clickable region; computing a degree of confidence of each category for the suggested queries; and separately determining target categories for the suggested queries based on the degree of confidence of each category for the suggested queries. As such, the user may quickly identify his/her search intention based on the target categories corresponding to the suggested queries, thereby effectively improving the speed of information searching. | 12-27-2012 |
20130132363 | METHOD AND APPARATUS FOR IDENTIFYING SYNONYMS AND USING SYNONYMS TO SEARCH - A method and an apparatus for identifying synonym and utilizing such synonym to conduct search is disclosed. The disclosed method includes: obtaining arbitrary two words to be identified; determining whether a shortest edit distance between the two words less than or equal to an edit distance threshold; determining whether the two words to be identified exist in a preset knowledge database, and if an answer is yes then searching a smallest granularity type with highest weight value for each word in the knowledge database; and if the two word have the same smallest granularity type with highest weight value, then determining such two words are synonyms, or non-synonym otherwise. The disclosed techniques greatly improve accuracy of synonym identification and guarantee effect of synonym identification. | 05-23-2013 |
20130166544 | GENERATING RANKED SEARCH RESULTS USING LINEAR AND NONLINEAR RANKING MODELS - Generating ranked search results includes receiving a plurality of matching information items that match a search request, ranking at least some of the plurality of matching information items using a linear ranking model that linearly combines a first plurality of feature values to obtain a first set of ranked results, ranking at least some of the first set of ranked results using a nonlinear ranking model that nonlinearly combines a second plurality of feature values to obtain a second set of ranked results, and provide a search response based on the second set of ranked results. | 06-27-2013 |
20130191381 | METHOD AND APPARATUS OF ORDERING SEARCH RESULTS - Ordering search results may include obtaining an exposed log file from a log system, computing a Bayesian posterior probability for relevancy between the log file and a search request, computing an expected value of the relevancy between the log file and the search request based on the Bayesian posterior probability, storing the search request and an identifier of the log file as a key and the expected value of the relevancy between the log file and the search request as a value into a search data structure, in response to receiving a search request submitted by a user, finding expected values of relevancy between the submitted search request and log files that are relevant to the submitted search request from the search data structure, and ordering the found log files in a descending order of the expected values. | 07-25-2013 |
20130325857 | Method and Apparatus of Ordering Search Results - Ordering search results may include obtaining an exposed log file from a log system, computing a Bayesian posterior probability for relevancy between the log file and a search request, computing an expected value of the relevancy between the log file and the search request based on the Bayesian posterior probability, storing the search request and an identifier of the log file as a key and the expected value of the relevancy between the log file and the search request as a value into a search data structure, in response to receiving a search request submitted by a user, finding expected values of relevancy between the submitted search request and log files that are relevant to the submitted search request from the search data structure, and ordering the found log files in a descending order of the expected values. | 12-05-2013 |
20140258844 | SELECTING A NAVIGATION HIERARCHICAL STRUCTURE DIAGRAM FOR WEBSITE NAVIGATION - Implementing website navigation includes deriving a confidence level for at least one leaf node of a website navigation category diagram using historical user operation data; generating one or more navigation hierarchical structure diagrams based on the website navigation category diagram; determining a searching cost associated with each of the one or more navigation hierarchical structure diagrams; determining the navigation hierarchical structure diagram associated with the lowest searching cost; and implementing at least in part a website navigation mode using the navigation hierarchical structure diagram associated with the lowest searching cost. | 09-11-2014 |
20140351246 | GENERATING RANKED SEARCH RESULTS USING LINEAR AND NONLINEAR RANKING MODELS - Generating ranked search results includes receiving a plurality of matching information items that match a search request, ranking at least some of the plurality of matching information items using a linear ranking model that linearly combines a first plurality of feature values to obtain a first set of ranked results, ranking at least some of the first set of ranked results using a nonlinear ranking model that nonlinearly combines a second plurality of feature values to obtain a second set of ranked results, and provide a search response based on the second set of ranked results. | 11-27-2014 |
20150039604 | Method and Apparatus of Ordering Search Results - Ordering search results may include obtaining an exposed log file from a log system, computing a Bayesian posterior probability for relevancy between the log file and a search request, computing an expected value of the relevancy between the log file and the search request based on the Bayesian posterior probability, storing the search request and an identifier of the log file as a key and the expected value of the relevancy between the log file and the search request as a value into a search data structure, in response to receiving a search request submitted by a user, finding expected values of relevancy between the submitted search request and log files that are relevant to the submitted search request from the search data structure, and ordering the found log files in a descending order of the expected values. | 02-05-2015 |
20150074076 | SEARCH METHOD, APPARATUS AND SYSTEM - The present disclosure describes a search method, a search apparatus and a search system. The method includes: a data rewriting system that obtains, from a database, one or more search term candidates that are relevant to a present search term. The data rewriting system retrieves properties of the present search term and the one or more search term candidates, where the properties describe respective matching results of the present search term and the one or more search term candidates. Based on the matching results, the data rewriting system determines whether or not the present search term needs to be rewritten, and rewrites the present search term based on the matching results to provide a rewritten present search term if it is determined that the present search term needs to be rewritten. A search engine performs a search based on the rewritten present search term. The disclosed method, apparatus and system avoid the approach of conducting a search based on fixed rules after the present search term is rewritten, thus reducing the probability of having ambiguity in the search process and improving the degree of search accuracy. | 03-12-2015 |
20150081683 | RANKING SEARCH RESULTS BASED ON WORD WEIGHT - Ranking search results, comprises retrieving search results that include target strings that relate to a query string; segmenting the query string and each of the target strings; pairing segments in the query string with respective segments in the target strings to form combinations; retrieving weights that correspond to the combinations; and determining a weighted word length based on the weights corresponding to each of the target strings; and ranking the target strings based on their respective weighted word lengths. Alternatively, ranking search results includes determining a minimum weight of each inserted word with respect to segments in the query string; determining a minimum weight of each deleted word with respect to segments in the target strings; determining a total edit distance for each target string; and ranking the target strings based on the total edit distances. | 03-19-2015 |