44th week of 2021 patent applcation highlights part 47 |
Patent application number | Title | Published |
20210342319 | DUPLICATE TABLE IDENTIFICATION IN ENTERPRISE DATABASE SYSTEMS FOR DATA STORAGE OPTIMIZATION - There are provided systems and methods for duplicate table identification in enterprise database systems for data storage optimization. A service provider, such as an electronic transaction processor for digital transactions, may determine data duplication in database tables so that database storage resources may be optimized. In order to determine data duplication, within database tables, a data collector daemon operation and/or application may collect metadata for tables within a domain. Using the metadata, a master table and derived tables may be determined for a group of the tables. Further, a duplication factor may be determined based on matching columns in the tables, a usage factor may be determined using processing hits to the tables, and a size factor may be determine based on table size. This allows for determination of a relevance score of the group, which provides a measure of duplication of data within those tables. | 2021-11-04 |
20210342320 | Method and system for identifying duplicate columns using statistical, semantics and machine learning techniques - With the availability of huge amount of data, it has becoming difficult to identify and manage duplicate data, especially when the data is in a plurality of columns. A method and system for identifying duplicate columns using statistical, semantics and machine learning techniques have been provided. The system provides a design framework to compare huge datasets at column level and identify potential duplicate columns, not based on the column title, but based on all of its values. The disclosure has ability to compare values in multiple columns and identify potential duplicate columns wherein comparison of values is not only for the exact match, but for semantic match, smart match, fuzzy match, and match after UOM conversion etc. using Statistical, semantics and machine learning techniques. | 2021-11-04 |
20210342321 | METHODS AND SYSTEMS FOR DATA TRACEABILITY AND PROVENANCE - Embodiments for determining processing performed on a data element are provided. A file system call associated with a data element stored in a storage system is detected. The file system call is analyzed. Data lineage for the data element is determined based on the analyzing of the file system call. | 2021-11-04 |
20210342322 | DATABASE IMPLEMENTATION FOR DIFFERENT APPLICATION VERSIONS - Techniques are disclosed for handling database queries from multiple different application versions using a single database. A computer system executing a database management system (DBMS) may receive a database query for a database, where the database query is received from a particular version of an application having a plurality of versions. The database query may specify the particular version. The computer system may then identify catalog tables, where the catalog tables include version information columns storing version access information. The computer system may determine that version access information for the database query indicates that one or more database objects corresponding to the database query are accessible to the particular version of the application. Based on the determining, the computer system may access one or more database objects that are responsive to the database query and are accessible to the particular version of the application, which may advantageously reduce or avoid duplication of data stored in the database. | 2021-11-04 |
20210342323 | SYSTEMS AND METHODS FOR COMPRESSING KEYS IN HIERARCHICAL DATA STRUCTURES - The present disclosure describes systems and methods for enhancing the space utilization of storing keys in hierarchical data structures, such as B+Tree leaf blocks, by partially or fully compressing the keys. Partially compressing the keys may include storing certain keys as offsets from a start key. Fully compressing the keys may include setting a “type” value included in the header of the leaf block to a value indicating that the keys are implicitly represented by an ordinal key index, which defines the ordinal arrangement of keys in the leaf block. Partially or fully compressing keys, according to the implementations described herein, reduces the amount of space needed to store keys in a leaf block of a B+Tree. | 2021-11-04 |
20210342324 | DETERMINING ADDITIONS, DELETIONS AND UPDATES TO DATABASE TABLES - A method, system and computer program product for determining changes to database tables. In an embodiment, the method comprises loading a first group of records in a first time period in a database; loading a second group of records in a second time period in the database; selecting one or more of the records in the database; and for each of the selected records, determining whether it was loaded into the database during the first time period or the second time period, and providing the each record with a specified designation based on the time period during which the each record was loaded into the database. In an embodiment, identified times when the records were loaded in the database are used to determine the time period during which the each record was loaded in the database. | 2021-11-04 |
20210342325 | MEMORY MANAGEMENT USING APPROXIMATED COUNT-MIN SKETCH DATA STRUCTURES - Disclosed herein are systems, methods, and software to use approximated counting with count-min sketch data structures. In one implementation, a computer may identify a value in a data object and determine whether a count identified in a count-min sketch data structure and associated with the value exceeds a threshold count. If the count does not exceed the threshold count, the computer may increase the count in a count-min sketch data structure. If the count does exceed the threshold, the computer may apply a probability function to determine whether to increase the count and, in response to the probability function indicating an increase to the count, increasing the count in the count-min sketch data structure. | 2021-11-04 |
20210342326 | PROCESSING SYSTEM FOR AUTOMATED ELECTRONIC RECORD CREATION AND TRANSMISSION - Mediums, apparatus, computer program code, and means may be provided to customize electronic records associated with a distributed communication network via an automated back-end application computer server. According to some embodiments, a relational database management system database and a plurality of database tables linked to the relational database management system database may be accessed. The back-end application computer server may retrieve electronic record layout information and utilize pre-determined logic associated with the plurality of linked tables, including logic that matches multiple pointers within a single pair of linked tables, to automatically insert accessed data into a customized electronic record in accordance with the electronic record layout information. The customized electronic record may then be transmitted to be rendered via an interactive user interface display. | 2021-11-04 |
20210342327 | METHOD, DEVICE, AND COMPUTER PROGRAM PRODUCT FOR INFORMATION PROCESSING - An information processing method can comprise updating, in response to an update request for a first item in a file system, first attribute information associated with the first item in an ordered data set comprising entries corresponding to items in the file system, the entries indicating path information of storage paths of the items and attribute information associated with the items, and entries corresponding to items under the same directory being consecutively organized in the ordered data set; determining a second item associated with the first item, a parent directory of the second item being same as that of the first item; acquiring second attribute information associated with the second item from the ordered data set; and determining, based on at least the first attribute information and the second attribute information, attribute information associated with the parent directory. The management efficiency of the file system can thus be improved. | 2021-11-04 |
20210342328 | MASTER DATA SUBSTITUTION - A method, a system, and a computer program product for execution master data substitution. One or more first data objects in a master data storage are determined for replacement. One or more second data objects are identified for replacing the first data objects for storage in the master data storage. Replacement of the first data objects with the second data objects is performed in accordance with one or more data object requirements. Replacement of the first data objects by the second data objects is executed in accordance with the one or more data object requirements. A resulting replacement data set is generated and stored. | 2021-11-04 |
20210342329 | SYSTEM OR METHOD TO RUN DISTRIBUTED VALIDATION OF WORKFLOWS ACROSS A NETWORK IN A SHARED DISTRIBUTED LEDGER IN MULTI-TENANT CLOUD ENVIRONMENT - A method of distributed smart contract deployment in a multi-tenant environment implemented by a computing device. The method including receiving a smart contract source, converting the smart contract source to a smart contract code for a tenant of the multi-tenant environment, the smart contract code to manage blockchain data transaction validation, and installing the smart contract code at the tenant to enforce logic of the smart contract source at the tenant in the multi-tenant environment. | 2021-11-04 |
20210342330 | LEDGER DATA GENERATION AND STORAGE FOR TRUSTED RECALL OF PROFESSIONAL PROFILES - A system for ledger data includes a block repository, a metadata database, and a processor. The block repository stores verified secure ledger data in one or more blocks that are cryptographically linked. The metadata database stores metadata information for the one or more blocks in the block repository. The processor is configured to receive a request to create a block content and to determine the block content for a block to be stored in the block repository and metadata content to be stored in the metadata database associated with the block to be stored in the block repository, wherein the metadata content includes one or more flags and source information; and broadcast the block to be stored in the block repository to be approved by participating nodes in a network. | 2021-11-04 |
20210342331 | CUSTOM METADATA TAG INHERITANCE BASED ON A FILESYSTEM DIRECTORY TREE OR OBJECT STORAGE BUCKET - A method and/or system of managing metadata are disclosed that include connecting a source data storage system (DSS) that stores both data and metadata to a metadata management platform (MMP); scanning metadata records onto the MMP from the DSS; storing metadata attributes for at least one of the group consisting of directories and buckets on the DSS in a look-up table on the MMP; and adding updated metadata attributes to the look-up table on the MMP for each subsequent scan of the DSS. | 2021-11-04 |
20210342332 | PRIVATE SHARED RESOURCE CONFIRMATIONS ON BLOCKCHAIN - A processor may identify one or more transaction verification requests from one or more entities. The processor may convert each of the one or more transaction verification requests into respective hashed transaction verification requests. The processor may send, on one or more private, anonymous channels, the hashed transaction verifications to an orchestrator. The processor my decrypt the hashed transaction verifications with the orchestrator. The processor may determine whether information in each of the one or more transaction verification requests matches. | 2021-11-04 |
20210342333 | PARTIAL UPDATES IN DATA COLLECTIONS IN A DATA STORAGE SYSTEM - A method and/or system of partially updating data in a data collection including determining, in a first incoming row of an incoming update file, whether a first incoming column contains new data, and in response using the new data in a first updated column of a first updated row in an updated master file; and in response to the first incoming column of the incoming update file not containing new data, determining whether a first master column in a first master row of the master table contains old data; and in response using the old data in the first updated column of the first updated row in the updated master file. | 2021-11-04 |
20210342334 | METHOD, DEVICE, AND COMPUTER PROGRAM PRODUCT FOR DATA MANAGEMENT - A method, a device, and a computer program product for data management are provided in embodiments of the present disclosure. A method for data management comprises: acquiring, from a storage system, a torrent file associated with downloading of a target file, the target file being stored in the storage system, and the torrent file comprising at least a plurality of data digests of a plurality of data blocks of the target file; generating metadata for the plurality of data blocks based on the torrent file, the metadata being in a format supported by a unified management system, and the unified management system being configured for data access across the storage system and at least one other storage system; and storing the metadata for data-block-level access to the plurality of data blocks through the unified management system. | 2021-11-04 |
20210342335 | IDENTIFYING RECURRING SEQUENCES OF USER INTERACTIONS WITH AN APPLICATION - Disclosed are database systems, computing devices, methods, and computer program products for identifying recurring sequences of user interactions with an application. In some implementations, a server of a database system provides a user interface of the application for display at a computing device. The database system stores data objects identifying a first plurality of user interactions with the application. The server receives information representing a second plurality of user interactions with the application. The server updates the database system to further identify the second user interactions. The server identifies a recurring sequence of user interactions from the first and second user interactions as resulting in a first target state of the application. The server updates the database system to associate the recurring sequence of user interactions with the first target state of the application. | 2021-11-04 |
20210342336 | Data Filtering Utilizing Constructed Graph Structure - A global filter allows data filtering using attributes across multiple Analysis Tools (ATs), by broadcasting semantic filter context objects. Upon selecting object attribute values, the filter context object is created with attribute names and values. A processing engine resolves the filter context object to a data object, and then subsequently to target data. A lateral filter finds related entities in a relational database, without having to maintain and/or duplicate all of the data into a graph database. The processing engine resolves lateral filters using an entity graph path calculation conducted in conjunction with the generation of a bootstrapped graph structure. That graph structure is constructed (bootstrapped) utilizing available database schematic information—e.g., pre-calculated (key) relations and metadata read from the relational database. From that information, relationships in the bootstrapped graph structure are determined. Possible paths between entities are used to generate an optimized SQL query to reach target data. | 2021-11-04 |
20210342337 | DYNAMICALLY MONITORING AN INFORMATION TECHNOLOGY NETWORKED ENTITY - Data intake and query system (DIQS) instances supporting applications including lower-tier, focused, work group oriented applications, are tailored to display the metrics for the needs of the user. An interface caused by operation of an entity monitoring system (EMS) operating in conjunction with the lower-tier DIQS displays the monitored entities as individual representations. The user selects a metric and a metric threshold. The EMS causes a display of an interface having a representation for each monitored entity. Each representation includes a metric value and indicates an entity status based on the metric value and the threshold. The user can dynamically change the threshold on the interface for easy visualization of aggregation of monitored entities to determine the performance of the infrastructure. The interface also provides the user with the ability to select an entity and click through to the entity analysis workspace for more detailed information. | 2021-11-04 |
20210342338 | SYSTEMS AND METHODS FOR RETRIEVING RELEVANT INFORMATION CONTENT WHILE TYPING - Methods, systems, and apparatus, including computer programs stored on computer-readable media, for retrieving relevant information content while typing are disclosed. Upon identifying text being entered by a user into a text field of a typing-enabled application, one or more keywords may be determined that match the text. The matching keywords may be presented for selection. Upon receiving a selection of a keyword, an information card related to the selected keyword may be retrieved and presented. In addition to presenting the information card, information content associated with the information card may be inserted into the text field of the typing-enabled application. | 2021-11-04 |
20210342339 | Method for Defining and Computing Analytic Features - A system, method, and computer-readable medium are disclosed for constructing a distribution of interrelated event features. In various embodiments constructing the distribution includes: receiving a stream of events, the stream of events comprising a plurality of events; generating a query relating to the plurality of events, the query comprising condition information, the condition information defining a subset of query relevant events; processing the query relating to the plurality of events, extracting features from the plurality of events based upon the query; constructing a distribution of the features from the plurality of events based upon the query; and, analyzing the distribution of the features from the plurality of events based upon the query. | 2021-11-04 |
20210342340 | SYSTEMS AND METHODS FOR GENERATING SEARCH QUERIES USING TOGGLE BUTTONS ASSOCIATED WITH PRODUCT CATEGORIES - In some instances, the disclosure provides a method for generating search queries using toggle buttons associated with product categories. The method comprises causing display of a plurality of search recommendation indicia on a graphical user interface. The search recommendation indicia includes toggle buttons associated with product categories and product indicia associated with products. The method further comprises receiving user inputs for a toggle button associated with a product category and product indicia indicating selections of products, determining whether the selected products belong to the product category, generating a search query based on the product category, the selected products, and whether the selected products belongs to the product category associated with the toggle button. Then, the method comprises providing the search query to a backend server and receiving results from the search query from the backend server. | 2021-11-04 |
20210342341 | DATA ANALYSIS ASSISTANCE DEVICE, DATA ANALYSIS ASSISTANCE METHOD, AND DATA ANALYSIS ASSISTANCE PROGRAM - An analysis process receiving unit | 2021-11-04 |
20210342342 | SYSTEMS AND METHODS FOR SECURE FILE MANAGEMENT VIA AN AGGREGATION OF CLOUD STORAGE SERVICES - The present disclosure describes systems and methods for aggregation and management of cloud storage among a plurality of providers via file fragmenting to provide increased reliability and security. In one implementation, fragments or blocks may be distributed among a plurality of cloud storage providers, such that no provider retains a complete copy of a file. Accordingly, even if an individual service is compromised, a malicious actor cannot access the data. In another implementation, fragments may be duplicated and distributed to multiple providers, such that loss of communications to any one provider does not result in inability to access the data. This implementation may be combined with error correction techniques to allow recovery, even with loss of multiple providers. File synchronization may also be faster in these implementations by dividing reading and writing operations among multiple providers. | 2021-11-04 |
20210342343 | DATA HARVESTER - A data harvester enhances compliance audits by characterizing data sources, sampling data in one or more of the data sources to determine likelihood of success of the data harvest, estimating time for the data harvest, making recommendations from the samples based on machine learning relating to previous runs, then sampling additional data while estimated expected completion time. The harvested data may then be analyzed and compared to compliance requirements, and a compliance report may be generated. | 2021-11-04 |
20210342344 | Weighed Order Decision Making with Visual Representation - A system for the dynamic analysis of unstructured data where feedback loops exist between the user and the machine resulting in improved specificity and content (accuracy and precision) with regard to the results obtained from the machine learning algorithms. A Graphic User Interface (GUI) controls the configuration and deployment of all the features of the Intelligence Augmentation System (IAS) including data capture and processing, analytics, and feedback. Results of one set of algorithms can be forwarded to subsequent tools with the system for further analysis and planning using decision algorithms. The results are configured using a GUI that can manipulate the data in dynamically, allowing immediate visualization of user queries. | 2021-11-04 |
20210342345 | LATENT NETWORK SUMMARIZATION - Embodiments of the present invention provide systems, methods, and computer storage media for latent summarization of a graph. Structural features can be captured from feature vectors associated with each node of the graph by applying base functions on the feature vectors and iteratively applying relational operators to successive feature matrices to derive deeper inductive relational functions that capture higher-order structural information in different subgraphs of increasing size (node separations). Heterogeneity can be summarized by performing capturing features in appropriate subgraphs (e.g., node-centric neighborhoods associated with each node type, edge direction, and/or edge type). Binning and/or dimensionality reduction can be applied to the resulting feature matrices. The resulting set of relational functions and multi-level feature matrices can form a latent summary that can be used to perform a variety of graph-based tasks, including node classification, node clustering, link prediction, entity resolution, anomaly and event detection, and inductive learning tasks. | 2021-11-04 |
20210342346 | HIERARCHICAL DATA SEARCHING USING TENSOR SEARCHING, FUZZY SEARCHING, AND BAYESIAN NETWORKS - Methods and systems for performing a search over hierarchical data are provided. The method may be performed by a server comprising a processor and memory. The method includes receiving a query string from a user device. The query string is searched for via a tensor search of a graph structure to determine node tensor distance score for each node in the graph structure. The query string is searched for via a fuzzy search of the graph structure to determine node fuzzy distance score for each node in the graph structure. Nodes with relevant scores are determined by updating a Bayesian network representation with evidence based on the node tensor distance scores and the node fuzzy distance scores of each node. Relevant data from the nodes is sent with relevant scores to the user device. | 2021-11-04 |
20210342347 | FACILITATING TEMPORAL DATA MANAGEMENT FOR ANOMALOUS STATE DETECTION IN DATA CENTERS - Facilitating temporal data management for anomalous state detection in data centers is provided herein. A system can comprise a processor and a memory that stores executable instructions that, when executed by the processor, facilitate performance of operations. The operations can comprise performing a process of extraction, transformation, and loading of data from log files into a telemetry data store. The data can be loaded into the telemetry data store as telemetry data. The operations also can comprise dividing the telemetry data into first telemetry data and second telemetry data. The first telemetry data can comprise telemetry data that does not satisfy a defined quality level. The second telemetry data can comprise telemetry data that satisfies the defined quality level. Further, the operations can comprise removing the first telemetry data from the telemetry data store and outputting the second telemetry data based on a request for the second telemetry data. | 2021-11-04 |
20210342348 | METHOD FOR PARSING QUERY, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM - A method for parsing a query, an electronic device and a readable storage medium, which relate to the field of natural language processing technologies, are disclosed. According to some embodiments, the method includes: acquiring the query input by a user; constructing a syntactic dependency tree of the query; matching the syntactic dependency tree of the query with syntactic dependency trees of preset templates, and determining a target template according to the matching result; and marking a slot operator of a slot in the query using the target template, and the marked slot operator represents a logical relationship applied to the slot in the query. With the technical solution, the logical relationship applied to the slot in the query may be acquired, and the accuracy of query parse may be improved. | 2021-11-04 |
20210342349 | METHOD AND APPARATUS FOR DETERMINING AND PRESENTING ANSWERS TO CONTENT-RELATED QUESTIONS - Systems and methods for determining the answers to user queries regarding content, and presenting options for the user to view the answers. Searches are conducted to determine the answer to a query and, once the answer is found, various options are presented to the user, allowing the user the choice to view the answer in multiple different ways. | 2021-11-04 |
20210342350 | SYSTEM AND METHOD FOR REDUCING RESOURCE USAGE IN A DATA RETRIEVAL PROCESS - In certain embodiments, resource usage in a data retrieval process may be reduced. In some embodiments, a graph query related to a data request may be obtained. The graph query may be transformed into a query set based on a graph data model and patterns of the graph query. Upon generation, the query set may include queries and query operators linking the queries, where the query operators include a first query operator linking first and second queries of the queries or other query operators. Prior to execution of the first and second queries, a satisfiability issue may be predicted, where the satisfiability issue is related to combining results derived from the first and second queries. Based on the prediction, the first query operator may be removed from the query set to update the query set. The updated query set may be executed to satisfy the graph query. | 2021-11-04 |
20210342351 | DETERMINING QUERY JOIN ORDERS - A technique includes identifying a plurality of substitute candidates for a given multiple join of tables. Each substitute candidate includes a candidate multiple join of tables, and the number of tables of each candidate multiple join is less than the number of tables of the given multiple join. A given substitute candidate is selected based on a cardinality of the candidate multiple join of the given substitute candidate; a join order for the given multiple join is determined based on the given substitute candidate; and the query is processed based on the determined join order. | 2021-11-04 |
20210342352 | METHOD FOR DUPLICATE DETERMINATION IN A GRAPH - Embodiments of the present invention determines duplicates in a graph. The graph comprises nodes representing entities and edges representing relationships between the entities. The method comprises: identifying at least two nodes in the graph. A neighborhood subgraph may be determined for each of the two nodes. The neighborhood subgraph includes the respective node. The method further comprises determining whether the two nodes are duplicates with respect to each other, based on a result of a comparison between the two subgraphs. | 2021-11-04 |
20210342353 | ADAPTIVE FIELD-LEVEL MATCHING - Adaptive field-level matching is described. A system identifies first elements in a field of a prospective record for a database, and second elements in the field of a candidate record, in the database, for matching the prospective record. The system identifies features corresponding to any of the first elements that are identical to any of the second elements, any of the first elements that are absent from the second elements, and any of the second elements that are absent from the first elements. A machine-learning model uses the features to determine a field match score for the candidate record's field. Another machine-learning model weighs the field match score and weighs another field match score for another field of the candidate record to determine a record match score for the candidate record. If the record match score satisfies a threshold, the system identifies the candidate record as matching the prospective record. | 2021-11-04 |
20210342354 | STREAM BROWSER FOR DATA STREAMS - The described technology is generally directed towards a stream browser that provides a human-readable (e.g., hierarchical) view of event data for a data stream of a data streaming storage service. A stream browser associated with a data stream maintains routing key and event data for the data stream, e.g., in a primary index. The routing keys can correspond to container structures, e.g., folders and subfolders, with the events associated with each routing key corresponding to hierarchical children of the routing key parent container structures (e.g., like files). A secondary index can be used to evict the oldest events from the primary index to limit the number of data stream events returnable by the stream browser to a practical number. | 2021-11-04 |
20210342355 | METHOD FOR DISTRIBUTING MEDIA - A method of distributing content to consumers may include providing a server loaded with a computer code having an administrator user interface, a franchisor user interface, a franchisee user interface; a library of content accessible by an administrator, a franchisor, and/or one or more franchisees; and a consumer data bank. The method may further include linking the server with social media platforms, populating the library with a plurality of pieces of content, linking the content with a franchise unit location, and publishing the content to a social media account. A further step may include collecting consumer data and storing the consumer data in the consumer data bank. The franchisor and each franchisee may have access to the library of content and to consumer data. | 2021-11-04 |
20210342356 | MALICIOUS ACTIVITY DETECTION SYSTEM CAPABLE OF EFFICIENTLY PROCESSING DATA ACCESSED FROM DATABASES AND GENERATING ALERTS FOR DISPLAY IN INTERACTIVE USER INTERFACES - Various systems and methods are provided that retrieve raw data from issuers, reorganize the raw data, analyze the reorganized data to determine whether the risky or malicious activity is occurring, and generate alerts to notify users of possible malicious activity. For example, the raw data is included in a plurality of tables. The system joins one or more tables to reorganize the data using several filtering techniques to reduce the processor load required to perform the join operation. Once the data is reorganized, the system executes one or more rules to analyze the reorganized data. Each rule is associated with a malicious activity. If any of the rules indicate that malicious activity is occurring, the system generates an alert for display to a user in an interactive user interface. | 2021-11-04 |
20210342357 | COMMAND-LINE SCRIPT GENERATION WITH RELEVANCE SEARCH - A term-weighting and document-scoring function is used to search for a command line interface (CLI) script that is likely relevant to an operation specified in a natural language query. CLI scripts are created to perform various operations of a CLI-based application. A CLI script is associated with a description document having keywords associated with the individual commands used in the CLI script. The relevance of a CLI script to an intended operation is based on the term-weighting and document-scoring function which is applied to each component of each command in a CLI script and weighted accordingly. | 2021-11-04 |
20210342358 | INTELLIGENT DATASTORE DETERMINATION FOR MICROSERVICE - A method comprises dividing a plurality of operations of a microservice between a plurality of databases, and synchronizing data corresponding to the plurality of operations between the plurality of databases. The microservice is a create, read, update, delete (CRUD) microservice, and the plurality of operations comprise creating, reading, updating and deleting the data. | 2021-11-04 |
20210342359 | TECHNIQUES FOR ACCESSING ON-PREMISE DATA SOURCES FROM PUBLIC CLOUD FOR DESIGNING DATA PROCESSING PIPELINES - A system for simulating a dataset based on sample statistics, and generating pipeline instructions for a database using the simulated dataset, is disclosed. The system may be a cloud-based platform. The system can provide improved performance and security while designing and deploying Extract-Transform-Load (ETL) database pipelines to manage on-premises data. The system receives, from a remote client, sample statistics characterizing a distribution of data corresponding to a column of a database table. The system can generate, based on the received sample statistics, a simulated dataset representing the column of the database table. The simulated dataset may be generated using pseudo-random data values from a distribution with statistics matching the sample statistics. The system can build a simulated database relation based at least in part on the simulated dataset. The system can generate and deploy pipeline instructions using the simulated database relation. | 2021-11-04 |
20210342360 | VIRTUAL WAREHOUSE POOLS FOR EXECUTING TASKS - The subject technology determines a number of execution nodes to execute a task. The subject technology determines that no virtual warehouse from a pool of virtual warehouses includes at least the number of execution nodes. The subject technology generates a new virtual warehouse including at least the number of execution nodes. The subject technology determines whether a previous execution of a prior task was completed in less than a threshold time period of a time interval, the time interval corresponding to a particular period of time for executing at least one task. The subject technology increments a score corresponding to a size of a particular virtual warehouse. The subject technology selects the new virtual warehouse from the pool of virtual warehouses based at least in part on the incremented score and the number of execution nodes included in the selected new virtual warehouse. | 2021-11-04 |
20210342361 | DIGITAL PROCESSING SYSTEMS AND METHODS FOR TWO-WAY SYNCING WITH THIRD PARTY APPLICATIONS IN COLLABORATIVE WORK SYSTEMS - Systems, methods, and computer-readable media for syncing data between a tabular platform and a third-party application are disclosed. The systems and methods may involve accessing a first platform that displays a first set of data in a first format; accessing a second platform that displays a second set of data in a second format; linking the first set of data with the second set of data to enable migration of the first set of data to the second platform and the second set of data to the first platform; enabling the first platform to simultaneously display the second set of data in the second format; enabling alteration of the second set of data in the second platform through manipulation of the simultaneous display of the second set of data in the first platform; and syncing the second set of data as altered with the first data set. | 2021-11-04 |
20210342362 | System and Method for Prioritizing Replication Copy Activity - A method, computer program product, and computer system for triggering, by a computing device, a synchronization activity on a storage object. A request may be executed for a copy operation resulting from the synchronization activity to copy the storage object from a source to a destination when a current number of concurrent copy operations is below a maximum number of copy operations. The request for the copy operation resulting from the synchronization activity to copy the storage object from the source to the destination may be placed in a queue when the current number of concurrent copy operations is at least at the maximum number of copy operations. One or more attributes of the request in the queue may be identified. The request for the copy operation may be executed according to a replication policy based upon, at least in part, the one or more attributes of the request in the queue. | 2021-11-04 |
20210342363 | BLOCK FORMATION DEVICE AND BLOCK FORMATION METHOD, NODE DEVICE AND BLOCK CONFIRMATION METHOD - A block formation device and to a node device for a distributed database system, each having a unit for receiving a timing clock pulse from a time source and determining time slices of prescribed length on the basis of the timing clock pulse is provided. The block formation device is configured to select transactions to be confirmed precisely once within a respective time slice from unconfirmed transactions provided in the database system, to form an unconfirmed block from the selected unconfirmed transactions and to provide the unconfirmed block in the database system. The node device is configured to store a chain of confirmed blocks representing a transaction log of the database system; and, within a respective time slice, to confirm precisely one from unconfirmed blocks provided in the database system in the time slice precisely once and to add it to the chain of confirmed blocks. | 2021-11-04 |
20210342364 | DIGITAL DECOUPLING - This document described digital decoupling architectures that enable existing computing systems to run in parallel with new computing technologies. In some aspects, a method includes receiving, by a digital decoupling system and from a source computing system, one or more updated data sets that each include data that has been updated at the source computing system. A source data entry of a source table of a database of the digital decoupling system is updated based on each updated data set. In response to detecting the change to the source table, a target data entry that includes data of the updated source data entry is added to a target table of the database. An adapter module obtains the data of the target data entry and generates an event that specifies the data of the target data entry. The event is sent to one or more destination computing elements. | 2021-11-04 |
20210342365 | RESOURCE PROVISIONING SYSTEMS AND METHODS - A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization/Furthermore, the device processes the set of queries using the updated set of processors. | 2021-11-04 |
20210342366 | EMBEDDED CONTAINER-BASED CONTROL PLANE FOR CLUSTERED ENVIRONMENT - Techniques for providing an embedded container-based control plane for clustered environments. A clustered storage system includes one to many storage appliances, in which one storage appliance is designated as a “primary appliance” and any other storage appliance(s) is/are designated as a “peer appliance(s).” The primary appliance includes a central database, one peer appliance includes a synchronously replicated database, and any other peer appliances each include an asynchronously replicated database. The primary appliance further includes a global management IP for managing access to the central database. The primary and peer appliances each further include a control plane container for implementing command processing services. The control plane container of the primary appliance further implements command orchestration services. The clustered storage system with central and replicated databases and a distributed control plane avoids the need to re-synchronize data at the time of storage appliance failure and improves scalability. | 2021-11-04 |
20210342367 | METHODS AND SYSTEMS FOR MULTI-DIMENSIONAL AGGREGATION USING COMPOSITION - Multi-dimensional aggregation using user interface workflow composition is described. Data for a computer implemented process is in a set of related data objects in a data store with each object in the set of related data objects representing an entity modelled in the process. A number of levels for a multi-dimensional aggregation associated with a request is determined where each level of the multi-dimensional aggregation represents a different dimension of data values to be aggregated. Data is aggregated at the levels of aggregation based on the relationships between parent objects and children objects. The data for a final level of aggregation is output to a user interface. The final result includes multiple dimensions of data. | 2021-11-04 |
20210342368 | SYSTEMS AND METHODS FOR PROBABILISTIC DATA CLASSIFICATION - A system for performing data classification operations. In one embodiment, the system comprises a file system configured to store a plurality of computer files and a scanning agent configured to traverse the file system and compile data regarding the attributes and content of the plurality of computer files. The system also comprises an index configured to store the data regarding attributes and content of the plurality of computer files and a file classifier configured to analyze the data regarding the attributes and content of the plurality of computer files and to classify the plurality of computer files into one or more categories based on the data regarding the attributes and content of the plurality of computer files. Results of the file classification operations can be used to set appropriate security permissions on files which include sensitive information or to control the way that a file is backed up or the schedule according to which it is archived. | 2021-11-04 |
20210342369 | METHOD AND SYSTEM FOR IMPLEMENTING EFFICIENT CLASSIFICATION AND EXPLORATION OF DATA - Disclosed is a system, method, and computer program product for analyzing sets of data in an efficient manner, such that analytics can be effectively performed over that data. Classification operations can be performed to generate groups of similar log records. This permits classification of the log records in a cohesive and informative manner. | 2021-11-04 |
20210342370 | DYNAMIC PREFERENCE-BASED MATCHING - A method and system for preference-based matching can receive input data, preference data, and campaign data. Input data can be received from a first interface, and preference data can be associated with the input data. The input data can be ranked based on the preference data. The campaign criteria can be received from a second interface. The input data can be matched to the campaign criteria to create a match list, which can be sorted based on, for example, the preference data. | 2021-11-04 |
20210342371 | Method and Apparatus for Processing Knowledge Graph - The disclosure discloses a method and apparatus for processing knowledge graph. The method includes that: multiple groups of entity data and multiple candidate relationship templates are acquired from a text to be analyzed, the candidate relationship template being configured to describe a relationship between multiple pieces of entity data in a group of entity data; for each group of entity data, the number of times for which the candidate relationship template matched with the group of entity data in the text to be analyzed is matched successfully is determined; a probability of correct matching between each group of entity data and each candidate relationship template is determined according to the number of times for which each group of entity data is matched successfully with each candidate relationship template; and an entity data relationship in a knowledge graph is supplemented according to the probability of correct matching between each group of entity data and the candidate relationship template. | 2021-11-04 |
20210342372 | SYSTEMS AND METHODS FOR PROVIDING GEODATA SIMILARITY - A system may be configured to conflate vectorized source data. Some embodiments may: obtain first data from a first source and second data from a second source; determine a first polygon that encloses all features of the first data and a second polygon that encloses all features of the second data; determine a larger polygon that encloses the first and second polygons; divide the larger polygon into first tiles; extract, from each of the first tiles overlaying the first data and from the each tile overlaying the second data, a first set of features and a second set of features, respectively; and identify, based on a computed disagreement level satisfying a set of criteria, each of one or more of the tiles. A set of identified tiles or all of the tiles may then be displayed, including shaded indicators overlaying features of respective portions of the first and second data. | 2021-11-04 |
20210342373 | Location Data Integration and Management - A system and associated methodology manages localization data. According to one embodiment of the present invention a primary set of data associated with one or more specific locations is imported and matched to a predefined format. Thereafter external, secondary, data associated with each of the localities listed in the primary data set is collected from a plurality of third party location service providers. With the collected secondary data matched to the same predefined format a comparison is made between the plurality of secondary data sets and the primary data set. Differences between the data sets are identified and the primary data set modified as necessary. Thereafter, normalized data from the modified primary data set is exported to the third party location service providers to enhance consistency and reliable of locational data. | 2021-11-04 |
20210342374 | METHOD OF AND SYSTEM FOR DEFINING A SERVICE FOR SERVING MAP IMAGES - A system for and method of generating an API or an API template for use with a map service. The API, e.g., URL when invoked in a WMS server or WMTS server, causes generating a metadata for a set of one or more areas of interest (AOIs) and one or more date ranges. The metadata provides access to a dataset of surveys via getmap or gettile requests. The API is dynamic in that the effect of the API (the metadata generated) is automatically updated whenever there has been a new survey added in the set of AOIs and date ranges. The API template is for a TMS service, and when populated, provides for causing any tiled map in the set of AOIs and date ranges to be displayed. | 2021-11-04 |
20210342375 | SYSTEM AND METHOD FOR AUTOMATIC SUGGESTION AND OR CORRECTING OF SEARCH KEYWORDS - Systems and methods that may be used to automatically correct, complete and or suggest words or terms to the user of an electronic service (e.g., accounting service) while the user is entering a search keyword and or filling out a form field. The automatic correction, completion and or suggestion of words or terms are based natural language processing of historical data from a plurality of users of the electronic service. | 2021-11-04 |
20210342376 | METHOD AND APPARATUS FOR INFORMATION QUERY AND STORAGE MEDIUM - The present application discloses a method and an apparatus for information query, and an electronic device, which relates to a field of deep learning (DL), natural language processing (NLP) and artificial intelligence (AI) technology. The method includes: receiving a query sentence, segmenting the query sentence to obtain word segments, and obtaining a dependency relationship between two word segments and part of speech of the word segments; obtaining a coding sequence of the query sentence according to the dependency relationship and the part of speech of the word segments; matching the coding sequence with a generalized template to obtain a core corpus of the query sentence, wherein the generalized template comprises part of speech to be extracted and a dependency relationship to be extracted; and obtaining a query result corresponding to the query sentence based on the core corpus. The application no longer relies on the accumulation of massive business scenario data to enhance a generalization ability, which ensures accurate and efficient information query, and improves the efficiency and reliability of the information query process. At the same time, it may support information query in different business scenarios, with strong expansion capability and high universality. | 2021-11-04 |
20210342377 | ABSTRACTIVE MULTI-DOCUMENT SUMMARIZATION THROUGH SELF-SUPERVISION AND CONTROL - A method for generating enriched training data for a multi-source transformer neural network for generation of a summary of one or more passages of input text comprises creating, from a plurality of input text sets, training points each comprising an input text subset of the input text set and a corresponding reference input text from the input text set, wherein the size of the input text subset is a predetermined number. Control codes are selected based on reference features corresponding to categorical labels of reference texts in the created training points. The input text is enriched with the selected control codes to generate enriched training data. | 2021-11-04 |
20210342378 | EXTERNAL STORAGE OF UNSTRUCTURED DATABASE OBJECTS - Systems and methods to migrate unstructured objects such as, but not limited to, attachments and platform change documents from a database to a cloud-provided external object storage. The unstructured objects may be designated for migration based on their creation date and/or other characteristics. Migration of an object may include conversion of the object to a serialized file in an object notation format. The database may maintain a header of the migrated object indicating the location to which the object was migrated. | 2021-11-04 |
20210342379 | METHOD AND DEVICE FOR PROCESSING SENTENCE, AND STORAGE MEDIUM - The disclosure discloses a method and a device for processing a sentence, and a storage medium. The detailed implementation includes: during processing a sentence, obtaining a dependency tree graph among respective segmented words in the sequence of segmented words by performing a dependency parsing on a sequence of segmented words of the sentence, inputting the dependency tree graph and a word vector corresponding to each segmented word into a preset graph neural network to obtain an intermediate word vector of each segmented word in the sequence of segmented words, and obtaining a processing result of the sentence by performing the downstream task on the intermediate word vector of each segmented word. | 2021-11-04 |
20210342380 | GENERATIVE ONTOLOGY LEARNING AND NATURAL LANGUAGE PROCESSING WITH PREDICTIVE LANGUAGE MODELS - An ontology topic is selected and a pretrained predictive language model is primed to create a predictive primed model based on one or more ontological rules corresponding to the selected ontology topic. Using the predictive primed model, natural language text is generated based on the ontology topic and guidance of a prediction steering component. The predictive primed model is guided in selecting text that is predicted to be appropriate for the ontology topic and the generated natural language text. The generated natural language text is processed to generate extracted ontology rules and the extracted ontology rules are compared to one or more rules of an ontology rule database that correspond to the ontology topic. A check is performed to determine if a performance of the ontology extractor is acceptable. | 2021-11-04 |
20210342381 | SYSTEM AND METHOD FOR AUTOMATICALLY GENERATING CONCEPTS RELATED TO A TARGET CONCEPT - A method for generating a set of concepts related to a target concept includes accessing a set of candidate concepts, embedding the target concept and the set of candidate concepts in a semantic vector space, selecting one or more intermediate concepts from the set of candidate concepts in response to determining whether each embedded candidate concept in the set of embedded candidate concepts satisfies a predetermined relationship with the embedded target concept, and filtering the one or more intermediate concepts to yield the set of concepts related to the target concept. The method may further include generating a multiple-choice question in which the target concept corresponds to a correct answer choice and the set of concepts related to the target concept correspond to distractors. | 2021-11-04 |
20210342382 | GROUP PROFILE GENERATION AND CONTENT ITEM SELECTION BASED UPON GROUP PROFILES - One or more computing devices, systems, and/or methods are provided. A first plurality of sets of device information associated with a plurality of devices may be received and/or identified. The plurality of devices may be grouped into a plurality of groups based upon the first plurality of sets of device information. A plurality of group profiles associated with the plurality of groups may be generated. A first group profile of the plurality of group profiles may be associated with a first group of the plurality of groups. The first group profile may be generated based upon sets of device information, of the first plurality of sets of device information, associated with the first group. A request for content associated with a first device belonging to the first group may be received. A first content item may be selected for presentation via the first device based upon the first group profile. | 2021-11-04 |
20210342383 | DYNAMIC FEEDBACK IN A RECOMMENDATION SYSTEM - A media recommendation system may score media items according to user recommendations, popularity, and/or recency. The scores may be weighted to produce an overall score for each media item. Media items may be added to a pool for a specific user, from which media items are selected for playback. The contents of the pool may be modified based upon user feedback and other data. The pool may be modified dynamically and/or in real time as media items are consumed or rated by the user. | 2021-11-04 |
20210342384 | VIDEO GENERATION SYSTEM TO RENDER FRAMES ON DEMAND USING A FLEET OF SERVERS - Content controller system comprising rendering server system comprising a plurality of servers. The servers receiving a plurality of segment render requests that correspond respectively to segments included in a set of media content item segments. The servers render the segments corresponding to the segment render requests using a media content identification and a main user identification. Rendering the segments comprises retrieving metadata from a metadata database associated with the media content identification, rendering the segments using the metadata, generating a main user avatar based on the main user identification, and incorporating the main user avatar into the segments. The servers can upload the segments to a segment database and update segment states in a segment state database to indicate that the segments are available. Other embodiments are disclosed herein. | 2021-11-04 |
20210342385 | INTERACTIVE METHOD AND SYSTEM OF BULLET SCREEN EASTER EGGS - The embodiment of the present application provides an interactive method of bullet screen easter eggs, including: obtaining information sent by a client when playing a multimedia file, wherein the information comprises comment content and comment time; determining a target interface element for replacing the comment content; determining a timestamp of the target interface element according to the comment time; generating bullet screen data comprising the target interface element and the timestamp; and sending the bullet screen data to the client for displaying the target interface element in a form of a bullet screen on a playback interface of the multimedia file when the multimedia file is played to a time point that matches the timestamp of the target interface element. | 2021-11-04 |
20210342386 | SIMILARITY SEARCH ENGINE FOR A DIGITAL VISUAL OBJECT - The present invention provides a similarity search engine for a digital visual object (i.e., a digital image that represents a design, graphics, logo, symbols, words, or any combination thereof). The similarity search engine is based on a method that consists of conducting several independent search queries, thus each query examining a different aspect of similarity. | 2021-11-04 |
20210342387 | METHOD AND APPARATUS FOR IMAGE PROCESSING AND ASSOCIATED USER INTERACTION - Aspects of the disclosed technology relate to an apparatus including a memory and at least one processor. The at least one processor can obtain at least one image of a scene and determine a portion of interest within the scene based on a first input. The first input can include a non-touch input. The at least one processor can output, in response to the first input, content associated with the portion of interest and receive a second input from the user. The second input can include a non-eye gaze input and be associated with the content. An action can be initiated by the one or more processor based on the second input. | 2021-11-04 |
20210342388 | IDENTIFYING EQUIPMENT ASSEMBLY INFORMATION BASED ON IMAGE DATA - A method executable by at least one processor includes receiving image data representative of an industrial equipment assembly, identifying properties associated with the industrial equipment assembly based on the image data, identifying a set of industrial equipment assemblies associated with the industrial equipment assembly based on the properties associated with the industrial equipment assembly and data stored in a database, and categorizing the set of industrial equipment assemblies based on the data associated with the industrial equipment assemblies. The method also includes generating an inquiry based on the categorization of the set of industrial equipment assemblies, presenting the inquiry via an electronic display, receiving information responsive to the inquiry and associated with the industrial equipment assembly, identifying a subset of industrial equipment assemblies based on the information, and presenting a visualization associated with the subset of industrial equipment assemblies via the electronic display. | 2021-11-04 |
20210342389 | TECHNIQUES FOR IDENTIFYING COLOR PROFILES FOR TEXTUAL QUERIES - The disclosed techniques include at least one computer-implemented method performed by a system. The system can receive a textual query and process query features of the textual query to identify a color profile indicative of a color intent of the query. The system can identify candidate images that at least partially match the desired content and color intent of the query. The system can further order candidate images based in part on a similarity of a candidate color profile for each candidate image with the identified color profile of the query, and output image data indicative of the ordered set of candidate images. | 2021-11-04 |
20210342390 | IMAGE SEARCH SYSTEM, IMAGE SEARCH METHOD, AND PROGRAM - An image search system ( | 2021-11-04 |
20210342391 | SYSTEM AND METHOD FOR THE MIGRATION OF LEGACY DIGITAL CONTENT - A system and method enabling an upgraded digital recorder to detect, retrieve, decrypt, encrypt and securely store digital content retrieved from one or more legacy digital recorders. The legacy digital recorder is placed into a mode permitting access to stored digital content. An upgraded digital recorder then interfaces with the legacy digital recorder and mutual authentication is performed. Following successful authentication, the upgraded digital recorder disables most of the legacy digital recorder's functionality. Utilizing a digital key protecting the stored content from unauthorized access, the upgraded digital recorder retrieves and decrypts the digital content from the legacy digital recorder. The decrypted content is then analyzed and if needed converted to a format compatible with the upgraded digital recorder. The digital content is then indexed, encrypted and stored in the upgraded digital recorder. This process can be enabled via head-end provisioning and therefore requires little or no consumer or technician participation. | 2021-11-04 |
20210342392 | PERSONALIZED CONTENT SHARING - A method for providing personalized content sharing is disclosed. The method includes: presenting, by a user device of a user of the content sharing platform, a user interface (UI) including content and a UI share element allowing the user to share the content with other users; in response to a selection of the UI share element by the user in the UI, transmitting, to a server, a request to share the content with other users; receiving, from the server, a server-selected subset of contacts of the user on the content sharing platform, wherein contacts in the subset of contacts are selected from a plurality of contacts of the user on the content sharing platform based on at least one of (i) an affinity of the user with each contact of the subset of contacts on the content sharing platform, or (ii) interactions of the user with content of each contact of the subset of contacts; presenting, to the user, the subset of contacts of the user comprising a first contact of the plurality of contacts of the user and not including a second contact of the plurality of contacts of the user; and allowing the user to share the content with the first contact of the plurality of contacts. | 2021-11-04 |
20210342393 | ARTIFICIAL INTELLIGENCE FOR CONTENT DISCOVERY - The present disclosure relates to a video content discovery apparatus, system, method and computer program. In one aspect of the disclosure there is provided a video content discovery module configured to receive a content query, retrieve, from one or more information sources, text that relates to the content query, process the retrieved text, at least in part using Natural Language Processing, to transform the content query to a set comprising one or more video content descriptors, and identify one or more video segments of a plurality of available video segments, using the one or more video content descriptors. | 2021-11-04 |
20210342394 | DEFINING A GRAPHICAL VISUALIZATION ALONG A TIME-BASED GRAPH LANE USING KEY PERFORMANCE INDICATORS DERIVED FROM MACHINE DATA - A service monitoring system receives receiving, via a user interface, an identification of a service of an information technology environment, and causes display of a plurality of key performance indicators (KPIs) in the user interface. Each KPI of the plurality of KPIs indicates a measure of performance for the service. The service monitoring system receives, via the user interface, an identification of a time period, and an identification of one or more visual characteristics for KPI graph lanes. Each of the KPI graph lanes is indicative of one or more KPI values of a respective KPI of the plurality of KPIs, the one or more KPI values are obtained from execution of a search query associated with the respective KPI, and the search query uses the time period to obtain the one or more KPI values. The service monitoring system causes display of a plurality of KPI graph lanes based on the one or more visual characteristics. Each graph lane provides a graphical visualization of the one or more KPI values of the respective KPI, and the display of the plurality of KPI graph lanes allows for visual correlation among the plurality of KPIs over the time period. | 2021-11-04 |
20210342395 | STORAGE EDGE CONTROLLER WITH A METADATA COMPUTATIONAL ENGINE - Embodiments described herein provide improved methods and systems for generating metadata for media objects at a computational engine (such as an artificial intelligence engine) within the storage edge controller, and for storing and using such metadata, in data processing systems. | 2021-11-04 |
20210342396 | RETRIEVAL SENTENCE UTILIZATION DEVICE AND RETRIEVAL SENTENCE UTILIZATION METHOD - To enable a user to easily recognize temporal order of elements included in a retrieval sentence, a retrieval sentence utilization device | 2021-11-04 |
20210342397 | METHOD FOR WEIGHTING A GRAPH - The present disclosure relates to a method for a weighting graph comprising nodes representing entities and edges representing relationships between entities in accordance with one or more domains. The method comprises: pre-processing the graph comprising assigning weights to the nodes and/or the edges of the graph in accordance with a specific domain of the domains, wherein the weight indicates a domain specific data quality problem of attribute values representing an edge of the edges and/or an entity involved in that edge. The weighted graph may be provided for enabling a processing of the graph in accordance with the specific domain. | 2021-11-04 |
20210342398 | METHOD AND SYSTEM FOR ABSTRACTING INFORMATION FOR USE IN LINK ANALYSIS - Observable data points are collected and organized into a link-oriented data set comprising nodes and links. Information is abstracted for use in link analysis by generating links between the collected data points, including deriving links and inducing links. A link can be induced by linking together a pair of nodes that satisfy a distance function. Exemplary distance functions that can be used to induce links include geospatial proximity, attribute nearness, and name similarity. Paths can be identified between selected nodes of interest through a dataset operation, and nodes and/or links can be selectively included or excluded from the data set operation. The dataset can be augmented with pedigree information or one or more association nodes. Link information, including a trajectory and a connected path that selectively produces or excludes one or more intermediate nodes, can be displayed and/or produced in a specified format. | 2021-11-04 |
20210342399 | Neural network-based semantic information retrieval - A question and answer (Q&A) system is enhanced to support natural language queries into any document format regardless of where the underlying documents are stored. The Q&A system may be implemented “as-a-service,” e.g., a network-accessible information retrieval platform. Preferably, the techniques herein enable a user to quickly and reliably locate a document, page, chart, or data point that he or she is looking for across many different datasets. This provides for a unified view of all of the user's (or, more generally, an enterprise's) information assets (such as Adobe® PDFs, Microsoft® Word documents, Microsoft Excel spreadsheets, Microsoft PowerPoint presentations, Google Docs, scanned materials, etc.), and to be able to deeply search all of these sources for the right document, page, sheet, chart, or even answer to a question. | 2021-11-04 |
20210342400 | INFORMATION SEARCH METHOD AND APPARATUS - An information search method and apparatus, an electronic device, a computer-readable storage medium, and a computer program product are provided. The information search method includes receiving a request message including query information from a client; determining a suggested word result and a prefetched search result corresponding to the query information based on the request message; returning the suggested word result as a first response message for the request message to the client; and returning the prefetched search result as a second response message for the request message to the client. | 2021-11-04 |
20210342401 | Method, System, and Computer Program Product for Communication with an Intelligent Industrial Assistant and Industrial Machine - Provided is a method for communication with an intelligent industrial assistant and industrial machine. The method may include receiving a first natural language input from a user. The first natural language input may be associated with a first command for an industrial machine to perform a first process. The industrial machine may be instructed to perform the first process based on the first natural language input. A second natural language input may be received from the user while the industrial machine is performing the first process. A first response may be determined based on the second natural language input. Communication of the first response to the user may be initiated while the industrial machine is performing the first process. A system and computer program product are also disclosed. | 2021-11-04 |
20210342402 | GENERATION OF REGULAR EXPRESSIONS - A method is provided for generating regular expressions. In some embodiments, the method includes partitioning a first plurality of text strings into a plurality of substrings, each text string of the first plurality being malignant or benign. The method also includes generating a population list including a second plurality of text strings, each text string of the second plurality including at least one token. The method further includes creating subsequent iterations of the population list. Each iteration may include calculating a score for one or more text strings in the population list and removing one or more text strings from the population list. The method further includes in response to a determination to not update the population list, selecting, based on the one or more calculated scores, a third text string as a regular expression from a final iteration of the population list. | 2021-11-04 |
20210342403 | SYSTEM AND PROCESS FOR THE VERIFICATION OF DATA - The present invention relates to an apparatus and method for verification of data contained in an unstructured data source, such as a supplier invoice. It is very difficult to obtain information from an unstructured data source, such as an invoice, using processing technology, without error. The apparatus and method are arranged to locate a predetermined data element in the invoice, using an optical character recognition application. The predetermined data element may be a company number, such as an ABN. The ABN can then be used to access databases to obtain further information relating to the supplier. For example, account details. The further information can be cross referenced against the unstructured data to confirm that the further information, such as account number is present in the unstructured data. | 2021-11-04 |
20210342404 | SYSTEM AND METHOD FOR INDEXING ELECTRONIC DISCOVERY DATA - Systems and methods for efficiently processing electronically stored information (ESI) are described. The systems and methods describe processing ESI in preparation for, or association with, litigation. The invention preserves the contextual relationships among documents when processing and indexing data, allowing for increased precision and recall during data analytics. | 2021-11-04 |
20210342405 | DYNAMIC SEARCH SET CREATION IN A SEARCH ENGINE - Disclosed are various embodiments for dynamically generating a search set for use in a search application, such as a search engine, and using the search set to navigate collections of archived content. A request for user interface data is received from a client device in response to the client device attempting to access a domain. The domain may be parsed to identify a search category corresponding to one of the categories stored in a data store. Subcategories and search collections that are associated with the search category are identified and user interface data may be generated that cause at least a portion of the subcategories and search collections associated with a search component to be rendered in a display of the client device. The search component is configured to use a selected one of the at least a portion of the subcategories to narrow a search conducted using the search component by generating a narrowing search query. | 2021-11-04 |
20210342406 | DATA UPDATING METHOD, APPARATUS, SEARCH SERVER, TERMINAL AND STORAGE MEDIUM - Embodiments of present application provide a data updating method, an apparatus, a search server, a terminal and a storage medium. A search server provides a terminal with a search result webpage including one or more webpages associated with a search keyword, where the search result webpage also includes a pre-stored webpage summary obtained by the search server by crawling webpage content of each webpage in advance; then the terminal crawls a target webpage triggered by a user to obtain a current webpage summary, and the terminal sends comparison information of consistency comparison between the current webpage summary and the pre-stored webpage summary to the search server; and finally the search server updates the pre-stored webpage summary of the target webpage according to the comparison information of the target webpage. | 2021-11-04 |
20210342407 | DATA SHARING TOOL FOR FACILITATING REAL-TIME ACCESS TO CURRENT OR UPDATED DATASETS - In some embodiments, a data server generates an open data reference specifying a location for storing report data associated with a report. To generate the report data, the data server generates a data query based on a report definition that describes how the report data associated with the report is generated. The data server executes the data query on source data to obtain the report data and causes the report data to be stored on a storage device at the location specified by the open data reference. The report data can be retrieved via a network using the open data reference for integration within a third-party application. The data server further generates updated report data by executing, on the source data, an updated data query generated based on the report definition and has the report data stored on the storage device to be replaced with updated report data. | 2021-11-04 |
20210342408 | COMPUTER-AIDED METHODS AND SYSTEMS FOR DISTRIBUTED COGNITION OF DIGITAL CONTENT COMPRISED OF KNOWLEDGE OBJECTS - Methods and systems for distributed cognition of digital content include receiving submissions from community members regarding a knowledge object. Each community member has a reputation value and each submission includes an evaluation value representing an evaluation of the knowledge object by the community member. A consensus evaluation is determined based on a calculated combination of the evaluation values in the submissions received and the reputation values of the respective community members who submitted the submissions. While submissions are being received, the consensus evaluation of the knowledge object is iteratively updated based on submissions received, being a calculated combination of the evaluation values in the submissions received and the reputation values of the respective community members who submitted the submissions. Additionally, the reputation value for each community member who submitted the submissions is iteratively updated based on a determined contribution of the respective community member's submission to the updated consensus evaluation. | 2021-11-04 |
20210342409 | Data Filtering Utilizing Broadcast Context Object - A global filter allows data filtering using attributes across multiple Analysis Tools (ATs), by broadcasting semantic filter context objects. Upon selecting object attribute values, the filter context object is created with attribute names and values. A processing engine resolves the filter context object to a data object, and then subsequently to target data. A lateral filter finds related entities in a relational database, without having to maintain and/or duplicate all of the data into a graph database. The processing engine resolves lateral filters using an entity graph path calculation conducted in conjunction with the generation of a bootstrapped graph structure. That graph structure is constructed (bootstrapped) utilizing available database schematic information—e.g., pre-calculated (key) relations and metadata read from the relational database. From that information, relationships in the bootstrapped graph structure are determined. Possible paths between entities are used to generate an optimized SQL query to reach target data. | 2021-11-04 |
20210342410 | Content Suggestions for Content Digests for Assistant Systems - In one embodiment, a method includes, by one or more computing systems, receiving, from a client system via an assistant xbot, a first audio input from a user, parsing the first audio input to identify a request for a content digest from an online social network, retrieving a plurality of content objects from the online social network, each of the content objects being accessible by the user, generating a customized newsfeed summary based on a determined semantical meaning of each of the plurality of content objects, and sending, to the client system via the assistant xbot, instructions for presenting an audio clip to the user responsive to the request for the content digest, wherein the audio clip comprises the customized newsfeed summary. | 2021-11-04 |
20210342411 | EVENT STREAM PROCESSING - An example system receives a plurality of event streams. A separate stream rule is applied to each individual event stream to produce a filtered output event stream. The system also applies a correlation rule to the filtered output event streams to produce correlated event results. | 2021-11-04 |
20210342412 | SYSTEMS AND METHODS FOR INITIATING PERSONALIZED MEDIA - A system and method for initiating personalized media. The system includes a provider tag affixed to a provider object including a unique provider identifier. The system further includes a network interface device communicatively coupled to a communications network configured to support computing devices with retrieving information associated with the provider tag and generating personalized media presentations associated with the provider object configured to be presented on visual displays connected to the communications network. | 2021-11-04 |
20210342413 | IDENTIFYING CODE DEPENDENCIES IN WEB APPLICATIONS - A method may include receiving, from a browser, a request for a tracking pixel including a uniform resource locator (URL) of the tracking pixel and a session identifier (ID) of the browser. The URL of the tracking pixel may be included in a CSS selector. The method may further include sending, to the browser and as triggered by the request for the tracking pixel, a request to scan a document object model (DOM) of the browser. The request to scan the DOM may include the session ID. The method may further include receiving, from the browser, scan results including a set of dependencies on the CSS selector | 2021-11-04 |
20210342414 | ASYMMETRIC INFORMATION FLOW READING METHOD AND APPARATUS, TERMINAL AND STORAGE MEDIUM - The method asymmetric information flow reading mainly includes: a width and a height of the original information flow are preprocessed, the preprocessed information flow is stacked in a highly adaptive equal-width manner, information flow to be displayed after being preprocessed is allocated into a first information flow site and a second information flow site that are configured in advance respectively for concurrent display, detailed information of a required information element is displayed in the first information flow site according to a reading request of a user in any information flow site, and the first information flow is loaded into the second information flow site. With the adoption of the asymmetric information flow reading method and the asymmetric information flow reading apparatus, an inside page may be opened in a home page in an embedded manner by designing a dual information flow mode without changing an original information flow style. | 2021-11-04 |
20210342415 | DATA PROCESSING USER INTERFACE MONITORING SYSTEMS AND RELATED METHODS - In various embodiments, a system may be configured to analyze data for a particular consent capture point to identify a change in consent capture rate from the capture point. The system may, for example, be configured to automatically detect that the system has stopped receiving consent records from a particular capture point. In such embodiments, the system may be configured to generate an alert, and transmit the alert to any suitable individual (e.g., privacy team member, IT department member, etc.) regarding the capture point. The system may, for example, enable an entity to identify one or more capture points that may have become non-functional (e.g., as a result of one or more changes to the capture point). | 2021-11-04 |
20210342416 | TRANSFORMING COLLECTIONS OF CURATED WEB DATA - In non-limiting examples of the present disclosure, systems, methods and devices for surfacing collected web content are presented. A collection of web content may be maintained, wherein the collection of web content is divided into a plurality of sections, each of the plurality of sections comprising a subset of web content from a different webpage. An indication to export the collection of web content to a productivity application may be received. A plurality of attributes that each of the plurality of sections have a value for may be identified. A productivity application document may be populated with the plurality of attributes and the corresponding values from each of the sections. | 2021-11-04 |
20210342417 | ACCELERATOR FOR SPARSE-DENSE MATRIX MULTIPLICATION - Disclosed embodiments relate to an accelerator for sparse-dense matrix instructions. In one example, a processor to execute a sparse-dense matrix multiplication instruction, includes fetch circuitry to fetch the sparse-dense matrix multiplication instruction having fields to specify an opcode, a dense output matrix, a dense source matrix, and a sparse source matrix having a sparsity of non-zero elements, the sparsity being less than one, decode circuitry to decode the fetched sparse-dense matrix multiplication instruction, execution circuitry to execute the decoded sparse-dense matrix multiplication instruction to, for each non-zero element at row M and column K of the specified sparse source matrix generate a product of the non-zero element and each corresponding dense element at row K and column N of the specified dense source matrix, and generate an accumulated sum of each generated product and a previous value of a corresponding output element at row M and column N of the specified dense output matrix. | 2021-11-04 |
20210342418 | SYSTEMS AND METHODS FOR PROCESSING DATA TO IDENTIFY RELATIONAL CLUSTERS - Embodiments of the present disclosure relate to systems and methods that may be employed for processing data to identify relational clusters, the method including receiving, using at least one processor, prior event data, the prior event data comprising a plurality of fields, the plurality of fields corresponding to a plurality of columns and a plurality of rows; determining, using the at least one processor, column field value correlations between the plurality of fields in the plurality of columns; and determining, using the at least one processor, a first column of the plurality of columns with a column field value correlation beyond a predetermined threshold with a second column of the plurality of columns. | 2021-11-04 |