| Method and device to determine a descriptor for a signal representing a multimedia item, device for retrieving items in a database, device for classification of multimedia items in a database -> Monitor Keywords |
|
Method and device to determine a descriptor for a signal representing a multimedia item, device for retrieving items in a database, device for classification of multimedia items in a databaseRelated Patent Categories: Data Processing: Database And File Management Or Data Structures, Database Schema Or Data Structure, Application Of Database Or Data Structure (e.g., Distributed, Multimedia, Image)The Patent Description & Claims data below is from USPTO Patent Application 20080086510. Brief Patent Description - Full Patent Description - Patent Application Claims FIELD OF THE INVENTION [0001] The invention concerns a method and a device to determine a descriptor for a multimedia item. The invention concerns also a device for retrieving multimedia items in a database and a device for classification of multimedia items in a database. BACKGROUND OF THE INVENTION [0002] In various fields of signal and data processing, e.g. in multimedia asset management, small-sized, compact descriptors are calculated for multimedia items in order to compare two items or to search items in a database similar to a given item. [0003] For instance, images in a database--e.g. personal photographs or images from a video--may have associated descriptors to ease database organization into groups of similar images and retrieval of images similar to a given one. [0004] A problem of descriptors is that they should best reflect similarity of two items while being small-sized. [0005] One type of known and commonly used descriptors is based on a frequency decomposition of the signal of the multimedia item. Therefore, a bank of filters is used to generate each a filtered signal corresponding to a frequency band. Then, often the power of the filtered signals in each band is calculated. The totality of power values builds the descriptor. The use of filter banks is common for example in audio processing. Also for images, filter banks such as wavelets or Gabor filter banks are widely used in image analysis and retrieval. [0006] In order to enhance the capacity of a descriptor to reflect the characteristics of images and the similarity of images, one of the following measures is commonly applied: [0007] 1. The number of filters in increased; [0008] 2. The repartition and type of filters is optimised; [0009] 3. The precision of each filter is increased. [0010] The first measure can be realised for example by taking 12 instead of 8 filters. By this, the signal's frequency spectrum is better described. [0011] The second measure can be realised--in the case of images--by replacing wavelet filters by Gabor filters. While wavelet filters cover the 2-dimensional frequency spectrum by considering horizontal, vertical and diagonal frequencies, Gabor filters are more flexible and can describe frequencies in more directions. Hereby, the images, and notably the texture in images, can be better described. [0012] The third measure addresses the implementation of filters, notably digital filters, and can be realized by increasing the number of samples used to represent the filter kernel. For example, a Gabor filter can be enhanced when replacing a 16.times.16 kernel by a 32.times.32 kernel. [0013] A problem of filter banks is often, that the spectrums of filters overlap and thus the frequency bands are not properly calculated. For example, Gabor filters have Gaussian-shaped spectra. These spectra do inherently overlap. This overlap lowers performance of image retrieval notably when one or several filters include considerable parts of frequency zero. [0014] Let us take as an example two images showing stripes. Direction and frequency of stripes is identical in both images. The only difference is a spatially constant offset between both images. We calculate a descriptor for each image based on the power of Gabor subbands. Even if the images show the same type of texture, the descriptors will be the more different the higher the offset is. [0015] Let us take another example of two images showing the same scene at different daytimes. The more different the illumination is the more different the descriptors will be. For example, images showing cars are searched in a database using a given image showing a car at daytime. Then, images showing cars at lower light levels such as in the evening may not be found. [0016] This effect makes the performance of retrieval in databases more difficult, notably when semantically similar items are searched. For example, audio clips are searched having a similar rhythm to a given one. When audio clips have different signal offsets by technical reasons, some audio clips with same rhythm but different offset may not be found. [0017] A negative effect can also occur when descriptors based on filter banks are used to classify multimedia items. Hereby, the descriptor is fed into a classifier that attributes one or several labels to the image. For example, a classifier for outdoor scenes in images can detect an outdoor scene in a given image and generate the label "outdoor" for this image. A classifier is usually trained by a set of typical images. When these images include only daylight images, the classifier may not detect outdoor scenes with lower light level, for example in the morning. SUMMARY OF THE INVENTION [0018] The invention proposes a method to calculate the descriptors of multimedia items by using bank filters and avoiding at least one of the above mentioned drawbacks. [0019] To this end, the invention proposes a method to determine a descriptor for a signal representing a multimedia item comprising the step of applying to the signal a first bank of directional filters in order to obtain a first set of coefficients. [0020] According to the invention, the method comprises the steps of: [0021] applying to the signal a second bank of filters in order to obtain a second set of coefficients representing the low-pass filtered signal, [0022] calculating a descriptor representing said multimedia element by making the difference between the first set of coefficients and the second set of coefficients and [0023] calculating associated power of the difference. [0024] According to a preferred embodiment, the directional filters are Gabor type filters. Continue reading... Full patent description for Method and device to determine a descriptor for a signal representing a multimedia item, device for retrieving items in a database, device for classification of multimedia items in a database Brief Patent Description - Full Patent Description - Patent Application Claims Click on the above for other options relating to this Method and device to determine a descriptor for a signal representing a multimedia item, device for retrieving items in a database, device for classification of multimedia items in a database patent application. ### 1. Sign up (takes 30 seconds). 2. Fill in the keywords to be monitored. 3. Each week you receive an email with patent applications related to your keywords. Start now! - Receive info on patent apps like Method and device to determine a descriptor for a signal representing a multimedia item, device for retrieving items in a database, device for classification of multimedia items in a database or other areas of interest. ### Previous Patent Application: Virtual interview system Next Patent Application: Methods and systems for providing fault recovery to side effects occurring during data processing Industry Class: Data processing: database and file management or data structures ### FreshPatents.com Support Thank you for viewing the Method and device to determine a descriptor for a signal representing a multimedia item, device for retrieving items in a database, device for classification of multimedia items in a database patent info. IP-related news and info Results in 4.21974 seconds Other interesting Feshpatents.com categories: Electronics: Semiconductor , Audio , Illumination , Connectors , Crypto , |
||