Patent application number | Description | Published |
20120057630 | LOW COMPLEXITY TRANSFORM CODING USING ADAPTIVE DCT/DST FOR INTRA-PREDICTION - A method and apparatus encode and decode video by determining whether to use discrete cosine transform (DCT) and DST for each of the horizontal and vertical transforms. During encoding, an intra-prediction is performed based on an intra-prediction mode determined for an M×N input image block to obtain an M×N intra-prediction residue matrix (E). Based on the intra-prediction mode, each of a horizontal transform and a vertical transform is performed using one of DCT and DST according to the intra-prediction mode. During decoding, the intra-prediction mode is determined from an incoming video bitstream. The M×N transformed coefficient matrix of the error residue is obtained from the video bitstream using an inverse quantizer. Based on the intra prediction mode, one of DCT and DST is performed for each of an inverse vertical transform and an inverse horizontal transform. | 03-08-2012 |
20120082391 | LOW COMPLEXITY SECONDARY TRANSFORM FOR IMAGE AND VIDEO COMPRESSION - A method for encoding video or images includes receiving input data associated with a block within a video or image frame and performing a transform of the input data to produce a first set of output coefficients. The method also includes receiving the first set of output coefficients and performing a second transform to produce a second set of output coefficients. The method further includes quantizing the second set of output coefficients. The second transform is performed using a rotational transform matrix that is selected to maximize a degree of orthogonality of the rotational transform matrix. | 04-05-2012 |
20120183078 | FILTER ADAPTATION WITH DIRECTIONAL FEATURES FOR VIDEO/IMAGE CODING - A method for processing video information by a video encoder includes classifying video information based on at least one local directional feature of the video information to design multiple filters. The encoder encodes filter coefficients of the multiple filters. The multiple filters are designed with symmetric constraints on the filter coefficients configured according to the classification of the video information based on at least one local directional feature of the video information. The method also includes applying, by the encoder, the multiple directional filters to the video information. A decoder is configured to decode filter coefficients of multiple filters and apply the multiple filters to the video information. The decoder constructs, from decoded filter coefficients, multiple filters with symmetric constraints on the filter coefficients configured according to the classification of the video information based on at least one local directional feature of the video information. | 07-19-2012 |
20120320972 | APPARATUS AND METHOD FOR LOW-COMPLEXITY OPTIMAL TRANSFORM SELECTION - A video processing system includes prediction primary transforms, quantization, entropy coding and filtering configured to receive and compress video information and output compressed video information corresponding to the received video information. The compressed video information comprising prediction mode, transform block size, quantization parameter, and filtering type. The video processing system also includes a secondary transform configured to receive and compress the compressed video information. The video processing system also includes a quantization stage configured to receive and compress the transformed coefficients. The video processing system also includes an entropy coding stage configured to convert the compressed video information into binary bits. The video processing system also includes a filtering stage configured to improve the reconstructed video information for better prediction. | 12-20-2012 |
20120321205 | SHAPE AND SYMMETRY DESIGN FOR FILTERS IN VIDEO/IMAGE CODING - An apparatus includes a video/image encoder configured to design a two-dimensional star-shaped spatial filter and encode image/video information using the X-shaped spatial filter. The star-shaped spatial filter includes a first linear arrangement of coefficients that extend outwardly in a first diagonal direction from a center pixel coefficient, a second linear arrangement of coefficients that extend outwardly in a second diagonal direction, and two linear arrangements of coefficients that extend outwardly in horizontal and vertical directions from the center pixel coefficient from the center pixel coefficient. The second diagonal direction is oriented in a different direction relative to the first diagonal direction. | 12-20-2012 |
20130003856 | MODE-DEPENDENT TRANSFORMS FOR RESIDUAL CODING WITH LOW LATENCY - An apparatus and method for processing video data are provided. The method includes determining a primary transform C | 01-03-2013 |
20140016863 | APPARATUS AND METHOD FOR PERFORMING VISUAL SEARCH - A method comprises computing a color factor value indicating an amount of color gradients in at least one color channel from the query image. The method comprises combining the color-keypoints with the gray-keypoints when the color factor value is greater than a threshold. A method for performing a visual search comprises extracting a plurality of local descriptors from a query image and then selecting a subset of them based on various criteria's such as visual meaning score. A method comprises aggregating each mean vector for each visual codeword from distances between each visual codeword and local descriptors. The method comprises aggregating variance vector for each visual codeword from the distance between each visual codeword, and local descriptors. The method comprises transmitting aggregated mean vector information and aggregated variance vector information to a search server for efficient image retrieval. | 01-16-2014 |
20140185941 | ROBUST KEYPOINT FEATURE SELECTION FOR VISUAL SEARCH WITH SELF MATCHING SCORE - To improve feature selection accuracy during a visual search, interest points within a query image are two-way matched to features in an affine transformed image or otherwise transformed version of the query image. A user device implements a method for selecting local descriptors in the visual search. The method includes: detecting a first set of interest points for the original image; computing an affine transform matrix; computing a new image as a transformation of the original image using the affine transform matrix; detecting a second set of interest points from the and new image; performing a two-way matching between the first set of interest points and the second set of interest points; sorting matching pairs according to a specified self-matching score (SMS); assigning an infinite value to SMS of unmatched interest points from the original image; selecting the interest points based on SMS. Significant performance gains reduce false positive matches. | 07-03-2014 |
20140195560 | TWO WAY LOCAL FEATURE MATCHING TO IMPROVE VISUAL SEARCH ACCURACY - To improve precision of visual search processing, SIFT points within a query image are forward matched to features in each of a plurality of repository images and SIFT points within each repository image are backward matched to features within the query image. Forward-only, backward-only and forward-and-backward matches may be weighted differently in determining an image match. Two way matching may be triggered by query image bit rate in excess of a threshold or by a sum of weighted distances between matching points exceeding a threshold. Significant performance gains in eliminating false positive matches are achieved. | 07-10-2014 |
20140198998 | NOVEL CRITERIA FOR GAUSSIAN MIXTURE MODEL CLUSTER SELECTION IN SCALABLE COMPRESSED FISHER VECTOR (SCFV) GLOBAL DESCRIPTOR - A wireless communication device includes a processor configured to execute an image query. The image query utilizes cluster selection criteria for a cluster-aggregation based vectorization of a set of local features based on a quantity of top local features having the highest posteriori probability values. The cluster selection criterion is measured as the summation of the posteriori probability values of the top local features. The quantity of top local features is determined by a predetermined integer value greater than one. | 07-17-2014 |
20140201200 | VISUAL SEARCH ACCURACY WITH HAMMING DISTANCE ORDER STATISTICS LEARNING - Global descriptors for images within an image repository accessible to a visual search server are compared based on order statistics processing including sorting (which is a non-linear transform) and heat kernel matching. Affinity scores are computed for Hamming distances between Fisher vector components corresponding to different clusters of global descriptors from a pair of images and normalized to [0, 1], with zero affinity scores assigned to non-active cluster pairs. Linear Discriminant Analysis is employed to determine a sorted vector of affinity scores to obtain a new global descriptor. The resulting global descriptors produce significantly more accurate matching. | 07-17-2014 |
20140310314 | MATCHING PERFORMANCE AND COMPRESSION EFFICIENCY WITH DESCRIPTOR CODE SEGMENT COLLISION PROBABILITY OPTIMIZATION - A method and apparatus include extracting a global descriptor from a query image with a plurality of segments. The method also includes identifying segments with a desirable discriminating potential by analyzing data of the plurality of segments based on an available image database. The method also includes creating a bitmask where the identified segments are active. The method also includes masking any segment of the plurality of segments of the global descriptor that are inactive according to the bitmask | 10-16-2014 |
20150016516 | METHOD FOR INTRA PREDICTION IMPROVEMENTS FOR OBLIQUE MODES IN VIDEO CODING - In various embodiments, a method and a decoder include identifying a directional intra prediction mode with an angle of prediction. The method also includes identifying a first and second reference neighboring samples in a block of the video along the angle of prediction; the angle of prediction intersects a pixel to be predicted. The method further includes determining which of the first and second reference samples is nearest the angle of prediction and applying a value of the nearest reference neighboring sample to the pixel as a predictor. Also, a method and a decoder include determining whether a block type of a block of the video is intra block copy. The method also includes responsive to the block type being the intra block copy, determining a transform block size of the block and, responsive to the transform block size being 4×4, applying a discrete sine transform to the block. | 01-15-2015 |
20150029394 | DISPLAY POWER REDUCTION USING HISTOGRAM METADATA - A method includes identifying an optimal backlight value for at least one quality level of a first video segment. The method also includes transmitting data for the first video segment. The transmitted data for the first video segment includes a message containing a first set of display adaptation information. The first set of display adaptation information includes the optimal backlight value for the at least one quality level of the first video segment. The method further includes identifying a backlight value for the at least one quality level of a second video segment. The method also includes determining a maximum backlight value change threshold between successive video segments. In addition, the method includes applying temporal smoothing between the optimal backlight value and the backlight value based on the maximum backlight value change threshold. | 01-29-2015 |