05th week of 2016 patent applcation highlights part 50 |
Patent application number | Title | Published |
20160034455 | MEDIA OBJECT MAPPING IN A MEDIA RECOMMENDER - A method of matching media objects comprises identifying an unknown media item as being the same media item as one of a plurality of known media items by comparing identifying information of the unknown media item with identifying information of one or more known media items. Comparing the identifying information comprises generating a match score for the unknown media item and at least one of the plurality of known media items by comparing the identifying information and determining that the media items match if the match score exceeds a fuzzy match threshold score. If the match score is below the fuzzy match threshold score but above a weak match threshold score, a crowdsource determination of whether the media items match is generated by receiving from two or more users a user determination of whether the media items match. | 2016-02-04 |
20160034456 | MANAGING CREDIBILITY FOR A QUESTION ANSWERING SYSTEM - A method and system for managing credibility of a set of search results for a search query is disclosed. The method can include determining, by a natural language processing technique configured to analyze a portion of the set of search results and a portion of the search query, a credibility factor configured to indicate similarity to a subject matter of the search query. The method can also include establishing a relevance relationship between the credibility factor and source information of a first search result of the set of search results, wherein the source information is based on the credibility factor. The method may also include computing a credibility score for the first search result of the set of search results based on the relevance relationship between the credibility factor and the source information of the set of search results. | 2016-02-04 |
20160034457 | Changed Answer Notification in a Question and Answer System - Mechanisms are provided, in a question and answer (QA) system, for monitoring changes in answers to previously submitted questions. A question is received for which an answer is to be generated. A first answer for the question is generated and the question, first answer, and a notification party are registered in a question registration data structure. A second answer to the question is generated, at a later time than when the first answer was generated. A degree of a change in the answer to the question is determined. An answer change notification is transmitted to the notification party based on the degree of the change in the answer to the question. | 2016-02-04 |
20160034458 | SPEECH RECOGNITION APPARATUS AND METHOD THEREOF - There is provided a speech recognition controlling method which includes extracting a keyword by crawling a webpage, adding the keyword to a lexicon in which a plurality of words are registered and updating the lexicon, recognizing, in response to a user speech being input, the speech based on the updated lexicon, performing a search according to the recognized result, and displaying a result of the search. | 2016-02-04 |
20160034459 | CURATING MEDIA FROM SOCIAL CONNECTIONS - Media files may be curated for a user based on a location of a client device associated with the user. A user carrying a client device may arrive at a particular location on a given day. A computer-implemented process may include determining whether the given day and/or the particular location is “unique” to the user in some fashion, which may be based on one or more criteria used to quantify or otherwise evaluate uniqueness of the day or location. At least partly in response, media file sharing service may be queried to identify shared media files that were created at the particular location, wherein the shared media files are associated with one or more entities having social connections to the user. A curated subset of the identified media files may be automatically created and a notification may be issued to the user. | 2016-02-04 |
20160034460 | METHOD AND SYSTEM FOR RANKING MEDIA CONTENTS - A method is provided for ranking media contents. The method includes receiving media contents through a network and extracting feature values of the received media contents. The method also includes implementing a parameter reinforcement learning process to obtain automatically distribution over relativeness and irrelativeness of the received media contents. Further, the method includes ranking the received media contents by a multi-armed bandit algorithm based on the obtained distribution over relativeness and irrelativeness of the received media contents. | 2016-02-04 |
20160034461 | CONNECTION INSIGHTS WIDGET - Techniques for enhancing search results generated by a search engine are described. A first set of member profiles is identified, the first set of member profiles satisfying a search query initiated by a searcher. A second set of member profiles is identified, the second set of member profiles having a relationship between a profile of the searcher and each of the second set of member profiles. A subset of the second set of member profiles is selected based on strength of relationships between each of the subset of the second set of member profiles and the first set of member profiles. A portion of each of the first set of member profiles is communicated for presentation in a first region of a search results user interface and a portion of each of the subset of the second set of member profiles is communicated for presentation in a second region of the search results user interface. | 2016-02-04 |
20160034462 | Search Results Based on User Biases on Online Social Networks - In one embodiment, a method includes receiving a query, identifying one or more nodes of a plurality of second nodes corresponding to the query, calculating a score for each of the identified nodes using a probabilistic ranking model that scores each node based at least in part on a number of edges connecting the node to one or more nodes within a first set of user nodes that includes the first node and user nodes corresponding to second users sharing one or more user attributes with the first user, and generating corresponding search results. The score calculated for each of the identified nodes may bias the search results toward nodes connected to disproportionately more nodes in the first set of user nodes than nodes in the plurality of second nodes that correspond to an overall population of users of the online social network. | 2016-02-04 |
20160034463 | Identifying User Biases for Search Results on Online Social Networks - In one embodiment, a method includes receiving a query, determining a user bias of a first user of an online social network from a first node corresponding to the first user and a plurality of user nodes corresponding to a plurality of second users sharing one or more user attributes with the first user, identifying nodes of a plurality of second nodes based at least in part on the user bias of the first user, where the identified nodes correspond to the structured query, and generating search results corresponding to the identified nodes. The bias may be determined by identifying a candidate user node of the second nodes, comparing a first user attribute of the first node to a second user attribute of the candidate user node, and including the candidate user node in the user nodes when the first user attribute matches the second user attribute. | 2016-02-04 |
20160034464 | PERSONALIZED SEARCH BASED ON SEARCHER INTEREST - A system and method for personalized search based on searcher interest may include obtaining a search term from a member of a social network at a user device via the network interface. An initial result may be generated based on the search term, including a first group of content items from a social network and stored in a content database, the content items including member profiles of members of the social network. Each of the content items of the first group may be ranked based on information from an activity database, the activity database storing the information related to the social network, the activities including interactions with search results that include ones of the member profiles. A second group of the content items may be displayed, including at least some of the first group of the content items, based on the rank of the first group of the content items. | 2016-02-04 |
20160034465 | PERSONALIZED SEARCH BASED ON SIMILARITY - A system and method for personalized search based on similarity may include obtaining a search term. An initial result based on the search term and including a first group of content items as stored in a content database may be generated. Each of the content items of the first group may be ranked based, at least in part, on similarity scores, each of the similarity scores individually based on a first member profile relative to individual ones of second member profiles to which an activity related to a content item of the first group corresponds. The user device may display a second group of the content items, including at least some of the first group of the content items, according to the rank of the first group of the content items. | 2016-02-04 |
20160034466 | PERSONALIZED SEARCH USING SEARCHER FEATURES - A system and method for personalized search user searcher features may include obtaining a search term from a member of a social network at a user device via the network interface. An initial result may be generated based on the search term, including a first group of content items from a social network and stored in a content database, the content items including member profiles of members of the social network. Each of the content items of the first group may be ranked based on information indicative of interactions from an activity database with the content items of the first group, the interactions being by at least a second user of the social network different than the first user. A second group of the content items may be displayed, including at least some of the first group of the content items, based on the rank of the first group of the content items. | 2016-02-04 |
20160034467 | METHODS, SYSTEMS, AND COMPUTER PROGRAM PRODUCTS FOR GENERATING DATA QUALITY INDICATORS FOR RELATIONSHIPS IN A DATABASE - The disclosed methods, systems, and computer-program products allow a business to generate data quality indicators for relationships in a database. In an embodiment, one or more relationships linked to a customer are retrieved from a database to form a set of relationships. A match confidence code is generated for each relationship based on a score generated by the comparison of customer data associated with the respective relationship and corresponding customer data obtained from an external industry database. A link confidence code is subsequently determined for the customer based on a score generated by the scores used to define the match confidence code for each relationship in the set of relationships and on internal data associated with each relationship in the set of relationships. | 2016-02-04 |
20160034468 | TESTING OF AND ADAPTING TO USER RESPONSES TO WEB APPLICATIONS - The technology disclosed relates to web analytics and, in particular, to testing user reactions to alternative browser or web application presentations. Some implementations present a selected, ordered set of images. The position and ordering of individual images can be significant to user response. Some implementations adapt a background, motif, or image set based on a requesting user's preferences, such as a color preference. The technology disclosed simplifies test implementation, so that a few lines of code can be added to a web app to invoke the test platform and obtain operational parameters that shape a user's experience. | 2016-02-04 |
20160034469 | ELECTRONIC CONTENT RANKING BASED ON DISTRIBUTION AND CONSUMPTION - Distribution of content items provided by content producer computers to content consumer computers via a computer network is controlled and indications of different interactions with content items contained in messages distributed to content consumer computers are tracked. The different interactions with content items occur at the content consumer computers. Content items are indexed and ranked indications of at least some indexed content items are output in response to search queries. Tracking indications of different interactions with indexed content items occurs at the content consumer computers. Indications of different interactions with content items contained in messages distributed to content consumer computers and with indexed content items outputted in response to search queries are quantified. Content items are ranked based on the indications of different interactions. | 2016-02-04 |
20160034470 | PRIORITIZING DATA CLUSTERS WITH CUSTOMIZABLE SCORING STRATEGIES - Techniques are disclosed for for prioritizing a plurality of clusters. Prioritizing clusters may generally include identifying a scoring strategy for prioritizing the plurality of clusters. Each cluster is generated from a seed and stores a collection of data retrieved using the seed. For each cluster, elements of the collection of data stored by the cluster are evaluated according to the scoring strategy and a score is assigned to the cluster based on the evaluation. The clusters may be ranked according to the respective scores assigned to the plurality of clusters. The collection of data stored by each cluster may include financial data evaluated by the scoring strategy for a risk of fraud. The score assigned to each cluster may correspond to an amount at risk. | 2016-02-04 |
20160034471 | ENTITY DETECTION AND EXTRACTION FOR ENTITY CARDS - A system and method are provided for detecting entity information contained within search results. The detected entity information can be used to determine a category of entity as well as a specific entity within the search results. Entity information can be extracted from the documents associated with the search results. This information can be used as part of the information for an entity card, which can be displayed to a user in conjunction with and/or in place of the search results. | 2016-02-04 |
20160034472 | Rules Based Playlist Generation - Multiple playlists can be generated for broadcast or streaming. An aggregate difference representing a difference between the aggregate playout length of the multiple playlists and a target playout length can be determined. If the aggregate difference exceeds a difference threshold, a subsequent time period can be reduced, so a playlist generated for the subsequent time period can be constrained to be shorter. The playlists can be generated based on different versions of a ranking chart or list that includes ranked media items, e.g. songs, videos, etc., by automatically including the highest ranked media items to the playlist, but only adding some of the lower ranked media items to the playlist. Lower-ranked media items can be pseudo-randomly excluded from one or more of the playlists if the media item's rank in a newer version of a ranking chart is lower than its ranking in a previous version. | 2016-02-04 |
20160034473 | INFORMATION COLLECTING SYSTEM - Provided is an information collecting system which can easily handle data without considering entities of the data even when data unknown to the information collecting system is generated or collection of the unknown data is requested. The information collecting system comprises a managing region | 2016-02-04 |
20160034474 | Enterprise Data Mining in a Multi-Tenant Database - An enterprise software system connected to multi-tenant hosted software offered in a cloud computing environment having the capacity to serve a large number of users with a small number of servers, and means for collecting and reporting statistically relevant information based on an aggregation of the data within the multi-tenant database. The integrated software modules include modules for IT management, financial operations, portfolio management, project management, project budget management, resource management, and operations management. The system permits user-specific lexicography mapped to a Master terminology; ranking projects on financial and non-financial indicators; presentation of a dynamic dashboard of proposals and approved projects; provision of a service catalogue that incorporates budget and asset management processes; a multi-tenanted database that enables users to share data management resources while maintaining their data in confidence; and providing aggregate IT data for competitive intelligence purposes. | 2016-02-04 |
20160034475 | AGGREGATING, DISTILLING, AND ANALYZING LARGE-SCALE DATA SETS - Systems and methods for aggregating, distilling, and analyzing large scale data sets are disclosed. In some embodiments a task object is generated based on inputs received via a user interface defining actions to perform on one or more data sets. In some embodiments, the task object is forwarded to a pipeline processor configured to invoke a distributed computing cluster to process the data sets according to the defined actions. | 2016-02-04 |
20160034476 | FILE MANAGEMENT METHOD - A file management method according to the present invention is based upon a file management method of making a file from a client to a storage device redundant by a certain redundant number and storing the file in the storage device, and the file management method according to the present invention includes a first step of accepting an additional file from the client to the storage device, a second step of comparing capacity of the additional file and unused physical capacity of the storage device and a third step of changing the redundant number of the already stored file, increasing the unused physical capacity and storing the additional file in the storage device when the capacity of the additional file is larger than the unused physical capacity. | 2016-02-04 |
20160034477 | VIDEO SEARCH AND PLAYBACK INTERFACE FOR VEHICLE MONITOR - A method and apparatus including a plurality of video recordings in a database containing images of vehicles traveling within a predetermined geographic area and a list of license plates attached to each of those vehicles, a display the depicts a map of the geographic area, an input that receives an identifier of a license plate of a vehicle and a time period and a processor that searches the plurality of video recordings for the license plate, returns a list of cameras capturing images of the identified license plate, sorts the returned list by time of capture and displays a motion path of the vehicle on the map for the time period based upon the returned list. The method and apparatus may also include a uniform interface where the end-user can click the camera mark on the vehicle motion path to playback historic video recorded by this camera individually or click the whole motion path to playback a merged video which combines all recorded video from returned cameras into one display using only one window for the display of the merged video. | 2016-02-04 |
20160034478 | INCREMENTAL INFORMATION INTEGRATION USING A DECLARATIVE FRAMEWORK - Embodiments of the present invention relate to a declarative framework for efficient incremental information integration. In one embodiment, a method of and computer program product for information integration is provided. An integration rule is received. A first data set is accessed. A first representation of the first data is generated set based on the plurality of integration rules. The first representation is flat and includes a plurality of records. At least one index is generated. The index encodes at least one hierarchical relationship among the plurality of records. A second representation is generated of the first representation based on the at least one index. The second representation comprising nested data. | 2016-02-04 |
20160034479 | Disaster Response System - Health care data and other information about a person is stored in memory accessible to a server. The server allows users to access the health care data and information, such as across a communication network. In some embodiments a biometric identifier for a person is stored with the information. The biometric identifier can be used to control access to a person's records and to quickly locate information associated with a particular person during a disaster. | 2016-02-04 |
20160034480 | Determining Labels from Similar Geographic Features - Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, including a method that comprises: determining a target geographic feature that has insufficient targeting information associated therewith, the target geographic feature defining a location; determining one or more similar geographic features to the target geographic feature, each geographic feature including targeting information, the determining based on shared excess queries that are shared between geographic features that are determined to be similar to the target geographic feature, wherein each excess query is a query associated with and exceeds an expected query count for each of the similar geographic features and the target geographic feature; attributing targeting information associated with at least one of the one or more similar geographic features to the target geographic feature; and serving content responsive to queries that relate to the target geographic feature based at least in part on the attributed targeting information. | 2016-02-04 |
20160034481 | EFFICIENT VOLUME-LEVEL REPLICATION OF DATA VIA SNAPSHOTS IN AN INFORMATION MANAGEMENT SYSTEM - The illustrative systems and methods use a special-purpose volume-replicating server(s) to offload client computing devices operating in a production environment. The production environment may remain relatively undisturbed while production data is replicated to a geographically distinct destination. Replication is based in part on hardware-based snapshots generated by a storage array that houses production data. The illustrative volume-replicating server efficiently moves data from snapshots on a source storage array to a destination storage array by transferring only changed blocks for each successive snapshot, i.e., transferring incremental block-level changes. Periodic restore jobs may be executed by destination clients to keep current with their corresponding source production clients. Accordingly, after the source data center goes offline, production data may be speedily restored at the destination data center after experiencing only minimal downtime of production resources. By employing block-level techniques, the disclosed solutions avoid the file-based data management approaches of the prior art, which tend to be too time-consuming and resource-intensive for the present scenario. | 2016-02-04 |
20160034482 | METHOD AND APPARATUS FOR CONFIGURING RELEVANT PARAMETERS OF MAPREDUCE APPLICATIONS - Method and apparatus for configuring relevant paramenters of MapReduce applications. The method includes: receiving a request for processing a first MapReduce job; obtaining job feature attributes of historical MapReduce jobs; searching out a job feature attribute of a second MapReduce job associated with the first MapReduce job from within the job feature attributes of the historical MapReduce jobs; configuring relevant parameters of the first MapReduce job based on the job feature attribute of the second MapReduce job. According to this method of configuring parameters, network overheads of file transfer can be effectively reduced, the method of configuring parameters of the MapReduce job can be processed locally as much as possible, thereby effectively increasing system resource utilization while reducing the administrator's configuration burden. An apparatus for configuring relevant parameters of MapReduce application is also provided. | 2016-02-04 |
20160034483 | METHOD AND SYSTEM FOR DISCOVERING RELATED BOOKS BASED ON BOOK CONTENT - System and method for determining book similarities based on text content and thereby discovering related books for recommending to customer-users. Each book is associated with a probability distribution on a set of topics that is derived from text content of the book against the set of topics. The pair-wise distances of the probability distributions between corresponding books are computed to derive similarities thereof. The probability distributions may be generated by leveraging a text topic model that defines a set of topics, a respective set of relevant terms under each topic, and a probability distribution on each set of relevant terms. The text topic model may be automatically generated by processing content of a corpus of training books via a training process. | 2016-02-04 |
20160034484 | DOCUMENT TAGGING AND RETRIEVAL USING ENTITY SPECIFIERS - Techniques for managing big data include tagging of documents and subsequent retrieval using per-subject dictionaries having entries with some entries specially designated as entities. An entity indicates that the term in the entry has special meaning, e.g., brands (trademarks/service marks), trade names, geographic identifiers or other classes of terms. A dictionary may include a non-entity entry for a term and one or more entity entries, for different entity types. The entries may also include subject-determining-power scores. The subject-determining-power scores provide an indication of the descriptive power of the term with respect to the subject of the dictionary containing the term. The same term may have entries in multiple dictionaries with different subject-determining-power scores in each of the dictionaries. The entity distinctions for a term can then be used in tagging documents and processing retrieval requests. | 2016-02-04 |
20160034485 | SYSTEM AND METHOD TO STORE AND RETRIEVE IDENTIFIER ASSOCIATED INFORMATION CONTENT - In one embodiment, information content files, such as text files, image files, XML files and the like, that provide information related to an identifier bearing item, such as a consumer item with a barcode, are stored on a data storage device such as network server. Identifier data entries, such as UPC data, are associated with file data entries, such as file names, in a database. General file access information that is used in accessing the information content files is stored on a terminal. In operation an identifier is read by the terminal and is used to extract one of the file data entries from the database based on database communication information stored on the terminal. The file data entry in combination with the general file access information is used to establish communication with the data storage device and to extract one of the information content files. | 2016-02-04 |
20160034486 | Multi-Range and Runtime Pruning - A system, apparatus, and method for managing data storage and data access with querying data and filtering value ranges using only a constant amount of computer memory in the implementation of bloom filters based on a first consumption of a relation. | 2016-02-04 |
20160034487 | SELECTIVE FRAGMENTATION REPAIR - Selective repair of fragmentation in a synthetic backup, based at least in part on a dynamically-determined repair criteria, is disclosed. In various embodiments, a locality measure is computed with respect to a group of segments comprising a portion of a file. The computed locality measure is compared to an at least partly dynamically determined fragmentation repair criteria, and a repair decision is made based at least in part on the comparison. | 2016-02-04 |
20160034488 | METHOD AND APPARATUS FOR MODIFYING COMPRESSED FILES - A method, apparatus and computer program product are provided for preparing and installing update packages for compressed files. In the context of a method, a method for preparing an update package is provided that includes receiving an original file and a modified file, causing the original file and the modified file to be decompressed, and generating one or more delta files based on the decompressed original file and the decompressed modified file. A corresponding method for installing an update package is also provided that includes receiving the update package comprising one or more delta files corresponding to an original file, causing the original file to be decompressed, generating one or more modified subfiles based on the one or more delta files and the decompressed original file, and generating a compressed modified file by compressing the one or more modified subfiles. | 2016-02-04 |
20160034489 | SCHEDULING DEDUPLICATION IN A STORAGE SYSTEM - A system can maintain multiple queues for deduplication requests of different priorities. The system can also designate priority of storage units. The scheduling priority of a deduplication request is based on the priority of the storage unit indicated in the deduplication request and a trigger for the deduplication request. | 2016-02-04 |
20160034490 | CONFIGURATION REPLICATION IN A SEARCH HEAD CLUSTER - Replication of search-related configuration customizations across multiple individual configuration files of search heads of a cluster for a consistent user experience. The cluster of search heads may be synchronized such that the search heads operate to receive the configuration or knowledge object customizations from one or more clients. Following synchronization, the respective search heads may display the knowledge object customizations to the one or more clients via a user interface. | 2016-02-04 |
20160034491 | METHODS FOR ACCESSING BIG DATA AND SYSTEMS USING THE SAME - An embodiment of the invention introduces a method for accessing big data, which contains at least the following steps. A data access request is received from one of a plurality of database frontends of different kinds. A data access operation is performed for the data access request by using an API (Application Programming Interface) to manipulate one of a plurality of cloud file systems of different kinds. | 2016-02-04 |
20160034492 | SYSTEMS AND METHODS FOR ON-DEMAND DATA STORAGE - Systems and methods provide on demand data storage by creating an unpartitioned storage pool; generating a unique volume identifier for a data storage volume at a predetermined location; pre-provisioning the data storage volume in a volume queue ready for use on-demand; and storing data on the data storage volume at the predetermined location on-demand. | 2016-02-04 |
20160034493 | Systems and Methods for the Collection Verification and Maintenance of Point of Interest Information - Systems and methods for the collection, verification, and maintenance of point of interest information are provided. One example system includes a plurality of mobile collection devices respectively operated by a plurality of human collectors. Each of the mobile collection devices uploads to one or more intermediate servers information describing one or more attributes of a plurality of points of interest. The system includes a mobile verification device that receives, from the one or more intermediate servers, information uploaded by a first mobile collection device and provides an indication of an accuracy associated with the information received from the first mobile collection device. The system includes the one or more intermediate servers. The system includes one or more production servers. The information uploaded by the first mobile collection device is transcribed from the one or more intermediate servers to the one or more production servers. | 2016-02-04 |
20160034494 | SYSTEMS AND METHODS FOR RANKING POINTS OF INTEREST - Systems and methods are provided for ranking points of interest (POIs). According to one implementation, a computer-implemented method may be provided that includes receiving geographic information from a user device, and selecting a plurality of points of interest within the geographic region. The method may also include generating an initial ranking of the selected points of interest based on voting information received from the user device, viral input from a user's social network, and/or additional input based on searches for maps and travel directions for traversing a route. In addition, the method may include calculating a modified ranking of the selected points of interest. | 2016-02-04 |
20160034495 | MAP DIFFERENCE DATA DISTRIBUTION SYSTEM, MAP DIFFERENCE DATA DISTRIBUTION DEVICE, MAP DATA MAINTAINING DEVICE, UPDATE MANAGEMENT SERVER, AND MAP DIFFERENCE EXTRACTION SERVER - A map difference data distribution system includes a map difference data distribution device including a map difference data storage unit storing map difference data items and a distribution unit distributing map difference data items, and a map data maintaining device including update unit updating map data based on map difference data items. The map difference data storage unit traces the updated data items and data items having dependent relationships with the updated data items back to the different version updates, and stores the data items as map difference data items. The map difference data distribution device includes a non-updated data extraction unit extracting non-updated data items from the map data maintaining device, a grouping unit grouping the data items in dependent relationships, and a distribution data generation unit generating distribution data by arranging grouped records of non-updated data items from higher priority of map element under specified communication traffic volume. | 2016-02-04 |
20160034496 | System And Method For Accessing Electronic Data Via An Image Search Engine - The present invention provides a system and method for accessing electronic data through entry of images as queries in search engine. The system uses various image capturing devices and communication devices to capture images and enter them into image database. Image recognition techniques encode images in a computer readable format. The processed image is then entered for comparison into at least one database populated with images and associated information. Once the newly captured image is matched with an image in the database, the information linked with that image is returned to the user. | 2016-02-04 |
20160034497 | INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING APPARATUS - An information processing system includes an information processing apparatus and one or more terminal devices. The information processing system includes: an image transmitting unit configured to transmit an image; an image acquiring unit configured to acquire the image transmitted by the image transmitting unit, and assign unique identification information to the acquired image; a feature information extracting unit configured to extract feature information from the image acquired by the image acquiring unit; a first image storage unit configured to store therein the feature information and the identification information in an associated manner; and a managing unit configured to, when an amount of the feature information stored in the first image storage unit exceeds a threshold, change the first image storage unit into a second image storage unit, generate a new first image storage unit, and prohibit the second image storage unit from storing therein new feature information. | 2016-02-04 |
20160034498 | UNLIMITED RETROACTIVE DATA ELEMENT DIMENSION WIDENING - Systems and methods for retroactively performing dimension widening on a data element are provided. A described method includes receiving a set of data elements from a database and a set of dimension widening rules. The method further includes identifying one or more of the data elements to which any of the dimension widening rules are applicable. The method further includes processing the identified data elements by applying the applicable dimension widening rules to the identified data elements. A dimension widening delta is generated for each data element, and the dimension widening deltas may be stored. The method further includes merging one or more of the data elements with one or more of the dimension widening deltas in response to a query. | 2016-02-04 |
20160034499 | COMPRESSING AND TRANSMITTING STRUCTURED INFORMATION - Various of the present embodiments convert structured data to a channel-based representation facilitating pattern recognition by a compression utility. The channels may include a key channel, a number channel, a type channel, a length channel, etc. These channels reflect the same information as the structured data, but reorganized based upon the structured data's hierarchic structure. The encoder and decoder may include rule sets specifying the relations between the channels to deconstruct/construct the JSON file. Memoization, length encoding, and type encoding extensions may also be applied as part of the encoding/decoding process to further improve compression. | 2016-02-04 |
20160034500 | Normalization Rule Generation and Implementation Systems and Methods - Product records having attributes according to various schema are normalized such that the attributes thereof conform to a canonical schema. Attributes for large numbers of product records are normalized according to a map-reduce framework in which mappers are defined but no reducers. Rules are implemented by a rule engine that is instantiated one time per VM of the map-reduce framework. Likewise, each rule may be implemented by only one object instance per rule engine instance. Generation of rules by analysts may be facilitated by defining an attribute hierarchy. A rule defined for a parent attribute may be presented as a default rule for a child attribute and either accepted or revised. Attributes may be clusters and proposed rules generated. Proposed rules may then be accepted or rejected by an analyst. | 2016-02-04 |
20160034501 | GENERATING A DATABASE STRUCTURE FROM A SCANNED DRAWING - A method of generating a database structure from a scanned drawing of a desired database structure is provided. The method comprises: processing the scanned drawing to identify features of the desired database structure; and implementing the identified features to generate the desired database structure. | 2016-02-04 |
20160034502 | Automatic Detection of Potential Data Quality Problems - Technical solutions for detection potential data quality problems are provided. In some implementations, a method includes: automatically without human intervention, identifying a subset of side effect data associated with a set of enterprise data. The side effect data include a plurality of data fields. The method further includes: selecting a first set of data quality detection rules in accordance with a first data field in the plurality of data fields; identifying one or more candidate data quality problems in the set of side effect data by comparing the set of side effect data to the first set of data quality detection rules; and responsive to identifying the one or more candidate data quality problems: causing to be displayed to a user: information representing the one or more candidate data quality problems; and one or more candidate solutions for correcting the one or more candidate data quality problems. | 2016-02-04 |
20160034503 | STORING AND RETRIEVING CONTEXT SENSTIVE DATA IN A MANAGEMENT SYSTEM - A management system may include a reconciliation engine configured to reconcile a first instance of a resource object from a first data provider and a second instance of the resource object from a second data provider to obtain a reconciled resource object, and store the first instance, and second instance, and the reconciled resource object in datasets. The management system may include a context sensitive query engine configured to receive a context-sensitive query including context information identifying a source originally providing context sensitive data associated with a context-sensitive attribute, and retrieve the context sensitive data from one or more of the datasets based on the context information. | 2016-02-04 |
20160034504 | EFFICIENT AGGREGATION, STORAGE AND QUERYING OF LARGE VOLUME METRICS - The present system provide for more efficient processing, storage and querying of metrics from a distributed system from which large volumes of metrics are collected. The present metrics processing system may store billions of performance metrics in a persistence storage system, such as an HBase storage system, for several days, with minimum space required and at the same time retaining a low level data granularity. The reporting queries may use a unique technique to find required metrics in the HBase persistence store using a portion of the key as a bit array. The present metrics processing system may user a very small number of keys to store minute level metrics data for a metric for several hours. The metric values may be pivoted to multiple time-bucketed keys at different times during their life time in the system. | 2016-02-04 |
20160034505 | SYSTEMS AND METHODS FOR LARGE-SCALE LINK ANALYSIS - Systems and methods for accepting relationship indications based on interaction among entities, where each relationship indication specifies that a respective pair of the entities may be related. A single entity record may be constructed that indicates all the linked entities that have been identified as being related to the entity. The entities may include an individual, a set of individuals, a communication terminal, a plurality of communication terminals, an organization, an e-mail address, a Web-site, a bank account and a home address. An analytics operation may performed, such as identifying a shortest sequence of interrelated entities that relate a first entity with a second entity, and acting upon the identified sequence. | 2016-02-04 |
20160034506 | METHOD AND SYSTEM FOR OFFLINE INDEXING OF CONTENT AND CLASSIFYING STORED DATA - A method and system for creating an index of content without interfering with the source of the content includes an offline content indexing system that creates an index of content from an offline copy of data. The system may associate additional properties or tags with data that are not part of traditional indexing of content, such as the time the content was last available or user attributes associated with the content. Users can search the created index to locate content that is no longer available or based on the associate attributes. | 2016-02-04 |
20160034507 | READ OPERATIONS IN A TREE-BASED DISTRIBUTED FILE SYSTEM - Techniques for cloning, writing to, and reading from file system metadata. Cloning involves identifying a first set of pointers included in a first root node in a file system metadata tree structure that stores file system metadata in leaf nodes of the tree structure, creating a first copy of the first root node that includes the first set of pointers, creating a second copy of the first root node that includes the first set of pointers, associating the first copy with a first view, and associating the second copy with a second view. Reading generally involves traversing the tree structure towards a target leaf node that contains data to be read. Writing generally involves traversing the tree structure in the same manner, but also creating copies of any nodes to be modified if those nodes are deemed to have a different treeID than a particular root node. | 2016-02-04 |
20160034508 | WRITE OPERATIONS IN A TREE-BASED DISTRIBUTED FILE SYSTEM - Techniques for cloning, writing to, and reading from file system metadata. Cloning involves identifying a first set of pointers included in a first root node in a file system metadata tree structure that stores file system metadata in leaf nodes of the tree structure, creating a first copy of the first root node that includes the first set of pointers, creating a second copy of the first root node that includes the first set of pointers, associating the first copy with a first view, and associating the second copy with a second view. Reading generally involves traversing the tree structure towards a target leaf node that contains data to be read. Writing generally involves traversing the tree structure in the same manner, but also creating copies of any nodes to be modified if those nodes are deemed to have a different treeID than a particular root node. | 2016-02-04 |
20160034509 | 3D ANALYTICS - Herein is disclosed an ontology solution that may incorporate four main stages, including training, ontology administration, ontology tagging, and ontology analytics. The training step involves internal machine learning in which the system learns the customer's specific domain. An initial ontology is passed to the ontology administration step where a user reviews the initial ontology and refines it to create a refined ontology. The refined ontology is then stored and passed to the tagging module. Tagging is a continuous online process that uses the ontology to tag tracked items in incoming interactions, and stores the tagged interactions in a persistent repository. Finally the tagged interactions are then used by the analytics module to analyze and extract business data based on an enhanced formulization of a company's internal knowledge and terminology. | 2016-02-04 |
20160034510 | INDEXING AND SEARCHING LOG RECORDS USING TEMPLATES INDEX AND ATTRIBUTES INDEX - A computer partitions a log stream into records having defined intervals. For each of the records, the computer selects a template from among a set of templates in a templates index repository based on an identifier for the record being associated in the templates index repository with the template, identifies an attribute contained in the record based on the template, and stores the attribute associated with an identifier for the record and an identifier for the template in an attributes index repository. The attributes index repository and the templates index repository are used to retrieve records responsive to search queries. | 2016-02-04 |
20160034511 | INDEXING USER-DEFINED MINUTIAE - In one embodiment, a computing device receives text input from a user. The computing device determines an action by the user indicated in the text, and at least a part of a phrase in the text that indicates one or more topics associated with the action. The computing device determines whether the at least part of a phrase matches one or more phrases stored in an indexed database. When the at least part of a phrase does not match one or more phrases of the indexed database, then the computing device determines a complete phrase input by the user, and determines one or more topics to be associated with the complete phrase. The computing device stores the complete phrase as an instance of the complete phrase. The computing device determines whether at least a threshold number of instances of the complete phrase has been stored. | 2016-02-04 |
20160034512 | CONTEXT-BASED METADATA GENERATION AND AUTOMATIC ANNOTATION OF ELECTRONIC MEDIA IN A COMPUTER NETWORK - Computerized systems for automating content annotation (e.g., tag creation and/or expansion) for low-content items within a computer network by leveraging intelligence of other data sources within a network to generate secondary content (e.g., a “context”) for items (e.g., documents) for use in a tagging process. For example, based on user assigned tags for an item, secondary content information can be generated and used to determine a new list of candidate tags for the item. Additionally, the context of an input item may be compared against the respective contexts of a plurality of other items to determine respective levels of similarity between the input item and each of the plurality of other items in order to annotate the input item. Techniques involving web-distance based clustering and leveraging crowd-sourced information sources to remove noisy data from annotated results are also described. | 2016-02-04 |
20160034513 | METHOD TO FILTER AND GROUP TREE STRUCTURES WHILE RETAINING THEIR RELATIONSHIPS - A method for filtering tree nodes and a method for grouping tree nodes are provided. The filtering method filters a plurality of nodes of at least one source tree according to a criterion. The result of the filtering includes at least one target tree and each target tree is a subset of one of the source trees. The grouping method splits a plurality of nodes of at least one source tree into a plurality of groups according to a criterion. Each group includes at least one target tree and each target tree is a subset of one of the source trees. The hierarchy of at least one source tree is preserved in the target trees after the filtering and/or the grouping. | 2016-02-04 |
20160034514 | PROVIDING SEARCH RESULTS BASED ON AN IDENTIFIED USER INTEREST AND RELEVANCE MATCHING - Computerized systems for providing interest-to-item matching when item metadata is lacking or unavailable such that desired items of interest (e.g., research datasets) may be located for a user. For instance, the computing system may generate a context of a user's interest based on information indicating the user's interest (e.g., authors of research document, title of research document), and use the context to identify potentially relevant items and determine the relevance of the items to the user's interest. Additionally, a searchable database of items is generated by extracting identifiers of low content items from publicly available sources, such as the Internet, and generating contexts for the identified items. The computing system then indexes the identified items in the database using the generated contexts thereby enabling users to search the database for items of interest. Moreover, generating a context for items provides better accessibility for items that have little or no indexable content (e.g., metadata). | 2016-02-04 |
20160034515 | Assessing Risk of Inaccuracies in Address Components of Map Features - To generate address components for a selected map feature, all polygonal map features containing or near the location of a selected map feature are identified. The error bounds of each identified polygon are modeled based on the quality of the boundary of the polygon. Then, the error bounds of the polygon are compared to the location of the selected map feature to determine the strength of the match. The address components corresponding to the identified polygons are suggested to be components of the address of the selected map feature based on the strength of the matches. In another embodiment, a risk of inaccuracy of a combination of address components in an edited map feature is determined from comparison to other map data and can be adjusted based in part on the magnitude of an inconsistency between address components. | 2016-02-04 |
20160034516 | METHOD AND SYSTEM FOR ANALYSIS OF AN OBJECT - Methods, systems, and computer products for analyzing an object to determine contents of the object. The methods, systems, and computer products include a procedure having one or more groups that segment the object to be analyzed. One or more items are assigned to the groups, and each item includes one or more attributes that are associated with the item. Further, a predefined sequence for evaluation of the items is associated with the procedure. Each item is then displayed in accordance with the predefined sequence for evaluation of each attribute associated with the items. To this end, an attribute field associated with each attribute is presented and can be electronically interacted with in order to indicate evaluation of the attribute. | 2016-02-04 |
20160034517 | WEB BASED DATA MANAGEMENT - Approaches are provided for assessing and displaying data. An approach includes determining one or more aggregate measures of data quality for data. The approach further includes assessing an overall data quality for the data based on the determined one or more aggregate measures of data quality. The approach further includes displaying the data, the determined one or more aggregate measures of data quality, and the assessed overall data quality. | 2016-02-04 |
20160034518 | DATA RESEARCH AND RISK MANAGEMENT ASSESSMENT APPLICATION - Systems, apparatus, and computer program products provide for a comprehensive platform in which users can gain access to data mapping and linkage information associated with multiple data sources, data systems, databases within the systems and the like. As such, the platform provides for time-efficient and reliable data management and research which aids the user in comprehending the connections between data from different data sources and included within different data systems, and the downstream impact (i.e., the impact of the data on other data fields) and upstream data source(s) (i.e., the secondary data fields used to calculate the data filed) of such data. | 2016-02-04 |
20160034519 | SYSTEM AND METHOD FOR VERIFYING THE CONTENTS OF FORMS RELATIVE TO A SEPARATE DATASET - A method is provided for verifying the contents of forms, comprising: receiving a dataset from a client, the dataset associated with a transaction; transmitting the dataset to a document vendor to be entered into and complete a transaction document form; receiving the completed transaction document form from the document vendor; generating a code uniquely associating the completed document with the dataset; printing the code onto the completed document; transmitting the completed document to the client; after the document has been executed, receiving the executed document from the client and separately the current dataset; using the code on the executed document, retrieving the stored transaction dataset; comparing the stored dataset with the dataset separately delivered as the current dataset; identifying all inconsistencies between the two datasets and storing these results as separate data; and transmitting a message to the client with the result of the comparison. | 2016-02-04 |
20160034520 | Apparatus and Method for Maintaining and Storing a Log of Status Information - A control circuit maintains and stores in a memory a log for a retail enterprise of information comprising the current status of various items including data preparation, aggregation of values, execution of statistical models, and development visualizations for each of a plurality of items that are offered for retail sale within the retail enterprise. By one approach the plurality of items represents only a subset of all items that are offered for retail sale by this retail enterprise. By another approach the plurality of items represents all items that are offered for retail sale by this retail enterprise. If desired, the aforementioned log comprises such information for each of a plurality of hierarchical user levels in the retail enterprise. | 2016-02-04 |
20160034521 | METHOD, DEVICE AND SYSTEM FOR RETRIEVING DATA FROM A VERY LARGE DATA STORE - Systems, methods and devices are provided for deploying data from an operational database with multi-version-concurrency-control, the method comprising: deriving a single SQL query statement for retrieving large amounts of related, heterogeneous data as output where the large amounts of data are internally self-consistent; transforming and decorating the single SQL query output to obtain deployment data; and transferring the deployment data to the deployment target. | 2016-02-04 |
20160034522 | AGGREGATING DATA IN A MEDIATION SYSTEM - Records received from one or more sources in a network are processed. For each of multiple intervals of time, a matching procedure is attempted on sets of one or more records, including comparing identifiers associated with different records to generate the sets and determining whether or not a completeness criterion is satisfied for one or more of the sets. The processing also includes, for at least some of the intervals of time, processing at least one complete set, consisting of one or more of the received records on which the matching procedure is first attempted during the interval of time and one or more records stored in a data store before the interval of time, and for at least some of the intervals of time, processing at least one incomplete set, consisting of one or more records stored in the data store before the interval of time. | 2016-02-04 |
20160034523 | SUB-BLOCK PARTITIONING FOR HASH-BASED DEDUPLICATION - Sub-block partitioning for hash-based deduplication is performed by defining a minimal size and maximum size of the sub-block. If one of a plurality of search criteria is satisfied by one of a plurality of hash values, declaring a position of the hash value as a boundary end position of the sub-block. If the maximum size of the sub-block is reached prior to satisfying one of the multiple search criteria, declaring a position of an alternative one of the hash values that is selected based upon another one of the multiple search criteria as the boundary end position of the sub-block. One of the plurality of search criteria is satisfied if n bits at predefined positions of a value calculated by applying an XOR operation on last calculated k hash values are equal to one of an mth predefined different patterns of bits. | 2016-02-04 |
20160034524 | SYSTEMS AND METHODS FOR ENHANCING USER DATA DERIVED FROM DIGITAL COMMUNICATIONS - A computer-implemented method for enhancing and utilizing user data derived from digital interactions includes receiving a submission generated by input into a client side application interface by a first user on a first computing device, and determining, based on attributes of the submission, that the submission is in response to an issue-specific communication advertising information concerning a first issue, the issue-specific communication indicating a request for a financial transaction, and that that the financial transaction related to the issue-specific communication is requested. The method includes generating a first dataset associated with the first user, searching one or more additional datasets for additional data to be associated with data elements of the first dataset, associating the additional data from the one or more additional datasets with the first user, and generating a data model corresponding to the first user. | 2016-02-04 |
20160034525 | GENERATION OF A SEARCH QUERY TO APPROXIMATE REPLICATION OF A CLUSTER OF EVENTS - A processing device performs a preliminary grouping of data items in a dataset to define one or more clusters and for each cluster, identifies a set of search terms for a search query that would retrieve data items in the cluster upon execution of the search query against the dataset. | 2016-02-04 |
20160034526 | ADVANCED SEARCHING OF RELATIONAL DATA VIA A MOBILE DEVICE - A system, method and apparatus for providing for searching of relational data via a mobile communications device are provided. Information identifying searchable tables containing fields is received using a metadata application program interface (API). The metadata API provides information about a schema of the fields and a relationship of at least one of the fields to its corresponding object. Information for generating a view of relational data is provided to the mobile communications device. The view is based on at least one of the fields of at least one of the searchable tables, and the relationship between the at least one of the fields and its corresponding object. A set of user inputs is received based on the relationship presented in the view. A query definition corresponding to the set of user inputs for searching the relational data in accordance with the set of user inputs is created and stored. | 2016-02-04 |
20160034527 | ACCURATE PARTITION SIZING FOR MEMORY EFFICIENT REDUCTION OPERATIONS - Embodiments of the invention relate to processing data records, and for a multi-phase partitioned data reduction. The first phase relates to processing data records and partitioning the records into a first partition of records having a common characteristic and a second partition of records that are not members of the first partition. The data records in each partition are subject to intra-partition data reduction responsive to a resource constraint. The data records in each partition are also subject to an inter-partition data reduction, also referred to as an aggregation to reduce a footprint for storing the records. Partitions and/or individual records are logically aggregated and a data reduction operation for the logical aggregation of records takes place in response to available resources. | 2016-02-04 |
20160034528 | CO-PROCESSOR-BASED ARRAY-ORIENTED DATABASE PROCESSING - A technique includes receiving a user input in an array-oriented database. The user input indicates a database operation and processing a plurality of chunks of data stored by the database to perform the operation. The processing in dudes selectively distributing the processing of the plurality of chunks between a first group of at least one central processing unit and a second group of at least one co-processor. | 2016-02-04 |
20160034529 | SYSTEMS AND METHODS FOR A DISTRIBUTED QUERY EXECUTION ENGINE - Various embodiments of the present disclosure can include systems, methods, and non-transitory computer readable media configured to receive at least one database query to be executed. Code corresponding to the at least one database query can be generated. One or more optimizations to the generated code can be performed to produce specialized modular code. The one or more optimizations can include Just-In-Time (JIT) compilation techniques. Respective portions of the code can be distributed to a plurality of distributed computing systems for execution, wherein each of the distributed computing systems is connected to a portion of the plurality of distributed computing systems. A result for the at least one database query can be provided. | 2016-02-04 |
20160034530 | SYSTEMS AND METHODS FOR A QUERY OPTIMIZATION ENGINE - Various embodiments of the present disclosure can include systems, methods, and non-transitory computer readable media configured to receive at least one database query to be executed. At least one computation graph corresponding to the at least one database query is generated. The computation graph is transformed to an optimized computation graph. The respective portions of the optimized computation graph are distributed to a plurality of distributed computing systems for execution. A result for the at least one database query is provided. | 2016-02-04 |
20160034531 | EFFICIENT JOIN-FILTERS FOR PARALLEL PROCESSING - According to one embodiment of the present invention, a method of transferring filters for operations in a distributed system comprises generating at each source node a first filter of a first size. The first filter indicates elements of objects that need not be processed for an operation. Each source node generates one more second filters of a second size less than the first size. Each generated second filter is based on a prior generated filter with each location representing a plurality of locations from the prior generated filter. A second filter is transferred from each source node to each of destination node. The elements of the objects to remove for the operation are determined based on the transferred second filters. Embodiments of the present invention further include a system and computer program product for transferring filters in a distributed system in substantially the same manners described above. | 2016-02-04 |
20160034532 | FLEXIBLE OPERATORS FOR SEARCH QUERIES - In some embodiments, a rewritten search query is received. The rewritten search query can comprise a first flexible operator being applied to a term, a first condition, a first satisfaction target indicating a minimum amount of search results for the first flexible operator that are to satisfy the first condition, a second condition, and a second satisfaction target indicating a minimum amount of search results for the first flexible operator that are to satisfy the second condition. Search results for the first flexible operator can be determined from a search index. A first set of documents from the search index can be determined to be part of the search results for the first flexible operator based on the first set of documents satisfying the first condition without requiring the first set of documents to satisfy the second condition until a skip number for the second condition has been reached. | 2016-02-04 |
20160034533 | DATABASE MANAGEMENT SYSTEM AND METHOD - A database management system has a query interface for receiving a query and a query executor for executing the received query. The query executor dynamically generates a task, and executes a plurality of tasks in parallel. In execution of each task, each time data is required to be read from a database, the query executor generates a task for acquiring the data, and in execution of the generated task, issues a data read request for reading the data from the database, to thereby shorten a time taken to execute each task. | 2016-02-04 |
20160034534 | TECHNIQUE FOR UPDATING A CONTEXT THAT FACILITATES EVALUATING QUALITATIVE SEARCH TERMS - The disclosed embodiments relate to a system that updates a context that facilitates evaluating qualitative search terms for an attribute during query processing. During operation, the system extracts a value for the attribute from each data item in a set of data items. Next, the system updates the context based on the extracted attribute values, wherein the context includes a concept-mapping for one or more qualitative search terms applied to the attribute, and wherein each concept-mapping associates a given attribute value with a numerical compatibility index that indicates a compatibility between the given attribute value and a corresponding qualitative search term. | 2016-02-04 |
20160034535 | Support Actual and Virtual SQL Dataflow by Streaming Infrastructure - A method includes binding a stationed operator to a database query engine, wherein the stationed operator comprises system utilities and an open executor to execute user-supplied application logic. The method further includes receiving an unbounded input stream of input tuples by the stationed operator, the stream of input tuples comprising a succession of chunks. The method also includes applying a query defined by the application logic to each successive chunk of the input stream to produce an output packet for each such chunk, and generating an unbounded output stream comprising the output packets. | 2016-02-04 |
20160034536 | PROVIDING CONTEXT IN ACTIVITY STREAMS - A computer-implemented method, computer program product, and computing system is provided for providing contextual information associated with a post in an activity stream. In an implementation, a method may include determining a need for contextual information relative to a post within an activity stream. The method may also include searching one or more contextual information sources relative to at least a portion of the post within the activity stream. The method may further include associating contextual information with the post within the activity stream. | 2016-02-04 |
20160034537 | PROVIDING CONTEXT IN ACTIVITY STREAMS - A computer-implemented method, computer program product, and computing system is provided for providing contextual information associated with a post in an activity stream. In an implementation, a method may include determining a need for contextual information relative to a post within an activity stream. The method may also include searching one or more contextual information sources relative to at least a portion of the post within the activity stream. The method may further include associating contextual information with the post within the activity stream. | 2016-02-04 |
20160034538 | IN-MEMORY CLOUD TRIPLE STORE - In an example embodiment, a triple to store in a triple store is received from a first application, the triple defining data having a subject, predicate, and object. The triple is sent to a triple store interface configured to store the triple in an in-memory database management system. Later, a request from a second application to query the triple store is received. A cloud-based triple store query server is used to process the request and generate an in-memory database management system query to the in-memory database management system. Results of the in-memory database management system query are received from the in-memory database management system. The result is sent to the second application for display to a user. | 2016-02-04 |
20160034539 | SYSTEM AND METHOD OF MANAGING METADATA - A server for searching for metadata corresponding to content includes a communicator configured to receive metadata corresponding to content related to at least one device of a user, from the at least one device; and a controller configured to store the received metadata in a remote storage that is separate from the server, the received metadata being searchable, and upon receiving a search request containing a search term for searching for metadata, to search the received metadata stored in the remote storage for metadata related to the search term. | 2016-02-04 |
20160034540 | SYNTHESIS OF WEBPAGE SNIPPETS USING SUB-PAGES OF THE WEBPAGE - A server device is configured to identify sub-pages corresponding to a particular webpage, where the particular webpage and the identified sub-pages may be associated with a same website; determine titles for the identified sub-pages; process the identified sub-pages to select a set of sub-pages, of the identified sub-pages, to use to generate a snippet for the particular webpage, where each webpage, in the set of sub-pages, may satisfy particular criteria, and a quantity of the set of sub-pages may be based on a length of a concatenation of the titles for the set of sub-pages; generate the snippet, for the particular webpage, using the titles for the set of sub-pages; and store the generated snippet in association with the particular webpage, where the generated snippet may be presented in a search result document when the particular webpage is identified as being relevant to a search query. | 2016-02-04 |
20160034541 | Operation and Method for Prediction and Management of the Validity of Subject Reported Data - A system for developing and implementing empirically derived algorithms to generate decision rules to predict invalidity of subject reported data and fraud with research protocols in surveys allows for the identification of complex patterns of variables that detect or predict subject invalidity of subject reported data and fraud with the research protocol in the survey. The present invention may also be used to monitor invalidity of subject reported data within a research protocol to determine preferred actions to be performed. Optionally, the invention may provide a spectrum of invalidity, from minor invalidity needing only corrective feedback, to significant invalidity requiring subject removal from the survey. The algorithms and decision rules can also be domain-specific, such as detecting invalidity or fraud among subjects in a workplace satisfaction survey, or demographically specific, such as taking into account gender or age. The algorithms and decision rules may be optimized for the specific sample of subjects being studied. | 2016-02-04 |
20160034542 | INTEGRATING VARIOUS SEARCH AND RELEVANCE PROVIDERS IN TRANSACTIONAL SEARCH - Search parameters are generated from search tokens that are entered within a transactional search context. A data provider is selected from a plurality of different data providers for executing a search, using the search tokens. Relevance calculators that calculate the relevance of matching search results, and sorting systems, that sort the search results, can also be selected from a plurality of different relevance calculators and sorting systems. The sorted search results are then displayed. | 2016-02-04 |
20160034543 | GENERATING ITEM LISTINGS ACCORDING TO MAPPED SENSOR DATA - In various example embodiments, a mapping system and method for generating product listings for machine sensed and user specified criteria are presented. In example embodiments, sensor data about an object, and user characteristic information are received. Physical characteristics are extracted from the sensor data and mapped with the user characteristic information and related characteristics to create mapped characteristics. Based on the mapped characteristics, item listings are identified, ranked and presented to the user. The user can subsequently refine the search criteria by adding, subtracting or reweighing the characteristics. | 2016-02-04 |
20160034544 | SYSTEM AND METHOD FOR MULTI-DIMENSIONAL DATA REPRESENTATION OF OBJECTS - Systems and method for multi-dimensional data representation of an object is provided. The multi-dimensional data representation method includes evaluating a similarity measure for a query corresponding to an object. The similarity measure between the objects are used to compute the similarity values corresponding to the object and based on at least one metadata dimension associated with the object. The similarity value are sorted to create a multi-dimensional array of similarity values. The similarity values are represented in a scalar form and a visualization interface displays a multi visual representation of the similarity values and data associated with the object. | 2016-02-04 |
20160034545 | SYSTEMS AND USER INTERFACES FOR DYNAMIC AND INTERACTIVE SIMULTANEOUS QUERYING OF MULTIPLE DATA STORES - Embodiments of the present disclosure relate to a computer system and interactive user interfaces configured to enable efficient and rapid access to multiple different data sources simultaneously, and by an unskilled user. The unskilled user may provide simple and intuitive search terms to the system, and the system may thereby automatically query multiple related data sources of different types and present results to the user. Data sources in the system may be efficiently interrelated with one another by way of a mathematical graph in which nodes represent data sources and/or portions of data sources (for example, database tables), and edges represent relationships among the data sources and/or portions of data sources. For example, edges may indicate relationships between particular rows and/or columns of various tables. The table graph enables a compact and memory efficient storage of relationships among various disparate data sources. | 2016-02-04 |
20160034546 | OPTIMIZING LOADING OF RELATIONAL DATA FOR APPLICATIONS - A mechanism for optimizing loading of relational data for applications is disclosed. A method includes monitoring a plurality of requests selecting entities and processes to be executed for the selected entities. The plurality of requests are generated in an application server. The method also includes associating an application context with each of the processes to be executed for the selected entities. The application context includes the process to be executed using an entity stored in a database external to the application server. The application further includes generating a fetch plan for the entity stored in the database; and modifying, the generated fetch plan in view of the selected entities and the associated application context. | 2016-02-04 |
20160034547 | SYSTEMS AND METHODS FOR AN SQL-DRIVEN DISTRIBUTED OPERATING SYSTEM - Various embodiments can include systems, methods, and non-transitory computer readable media configured to receive at least one operation to be performed using (i) first data that is managed by a first computing system and (ii) second data that is managed by a second computing system, the operation being received through an interface provided by the computing system, and wherein the operation is based at least in part on a Structured Query Language (SQL). At least one optimization can be performed based at least in part on the operation. The operation can be executed using at least the first data and the second data. A result generated can be provided upon executing the operation through the interface provided by the computing system. The computing system, the first computing system, and the second computing system are each able to concurrently process, access, and create at least a portion of the generated result. | 2016-02-04 |
20160034548 | System and Method for Obtaining Automated Scaling of a Virtual Desktop Environment - A virtual desktop system includes a virtualization host, a backup virtualization host, and a virtual desktop manager. The virtual desktop manager routes traffic of the virtual desktop system to the virtualization host, provides a baseline replication of the virtualization host on the backup virtualization host, provides a first snapshot replication of the virtualization host on the backup virtualization host, determines that the virtualization host has experienced a trigger event, synchronizes the virtualization host with the backup virtualization host in response to determining that the virtualization host has experienced a trigger event, and re-routes traffic of the virtual desktop system to the backup virtualization host in response to synchronizing the virtualization host with the backup virtualization host. | 2016-02-04 |
20160034549 | Hierarchical Chunking of Objects in a Distributed Storage System - Placement of object replicas in a distributed storage system includes, at a first instance, opening a journal for storage of object chunks. An object is received, which comprises one or more chunks. Each chunk comprises one or more storage blocks. The blocks for a single chunk are stored in a single journal. Global metadata for the object is stored, which includes a list of chunks for the object. Local metadata for the chunk is stored, which includes a block list identifying each block of the plurality of blocks. The local metadata is associated with the journal. The journal is later closed. The journal is subsequently replicated to a second instance. The global metadata is updated to reflect the replication, whereas the local metadata is unchanged by the replication. | 2016-02-04 |
20160034550 | System and method for enterprise data management - Systems and methods for managing, integrating and synchronizing data from a number of different sources for an enterprise. A Master Data Management (“MDM”) system may be used to hold and manage the enterprise's data, and a data integration server may be used to facilitate communications between the MDM system and other data sources, e.g., a CRM. The MDM system may store the enterprise's data as customer master data, which may include data from the enterprise's internal sources, data from the MDM provider, data from other data providers, and a Golden Record consolidated from such data. A data steward service may be used to maintain the customer master data and make it accurate and up-to-date. Users may directly search the customer master data in the MDM system, and may also update the CRM with the customer master data. | 2016-02-04 |
20160034551 | DETERMINATION OF DATA PARTITION - Partitioning a set of data using a binary format to more evenly distribute data values across a set of data partitions. By using a system of matching binary indices in a “back to front” manner, data values stored in or associated with data partitions are more evenly balanced. | 2016-02-04 |
20160034552 | PROCESSING DATA IN A DATA WAREHOUSE - Processing data of a data warehouse is provided and includes receiving, by a processing device, user input to create simple filter objects. Each filter object defines an ad hoc subset of a respective dimension of a dimension table of the data warehouse. User input is received to create a filtered operation object that specifies an operation and a plurality of the simple filter objects. The ad hoc subset differs from all subsets defined in the dimension table. | 2016-02-04 |
20160034553 | HYBRID AGGREGATION OF DATA SETS - The disclosed embodiments provide a system for processing data. During operation, the system obtains one or more filters associated with dimensions of a data set. Next, the system calculates a cardinality associated with the one or more filters. The system then adjusts a level of aggregation of a query associated with the one or more filters based on the cardinality and a query result limit for querying of the data set. Finally, the system uses the query to obtain a set of records from the data set. | 2016-02-04 |
20160034554 | LARGE-SCALE DATA CLUSTERING WITH DYNAMIC SOCIAL CONTEXT - A system and method for dynamic, semi-supervised clustering comprises receiving data attributes, generating a set of ensemble partitions using the data attributes, forming a convex hull using the set of ensemble partitions, generating a simplex vector by performing ensemble clustering on the convex hull, receiving dynamic links, deriving an optimal simplex vector using the simplex vector and the dynamic links, computing a current optimal clustering result using the optimal simplex vector, and outputting the current optimal clustering result. | 2016-02-04 |