This technology enables automatic classification of audio-visual data for the internet and for search engines. A main challenge in the field of multimedia content analysis is the transformation of human interpretations of audio-visual data to correlating machine processable representations. This invention analyses such contents with the help of high-level audio-visual descriptors and classification methods. A classification accuracy (CA) of 95.6 % is achieved by using two hidden Markov models (HMM). For the distinction between the features ‘commercial’ and ‘non-commercial’ a CA of 98.4 % can be achieved by using two HMM. A decision tree results in a CA of 95.5 % for music videos and a CA of 91.9 % for news. When distinguishing between ‘sport’ and ‘non-sport’, Bayes’ Theorem achieves a CA of 95.2 %.

1.
Method for classifying a video sequence (VS), characterized by analyzing the video sequence using a plurality of genre-specific detector modules (M...

- Method for classifying a video sequence (VS), characterized by analyzing the video sequence using a plurality of genre-specific detector modules (M1-M5), each genre-specific detector module providing a probability value (P1-P5) indicating the probability that the video sequence belongs to the genre assigned to the genre-specific detector module; and analyzing the probability values of the plurality of genre-specific detector modules using a combiner (CM) which analyzes said probability values and generates a classification signal (SC) classifying the video sequence as belonging to a specific genre (g); wherein the probability values of the plurality of genre-specific detector modules are analyzed by a first evaluating unit (EU1) of said combiner, said first evaluating unit providing a first probability vector (V1) comprising for each genre a first probability value indicating the probability that the video sequence belongs to the respective genre, and a first preliminary decision (PD1) indicating which genre the video sequence presumably belongs to, the probability values of the plurality of genre-specific detector modules are further analyzed by a second evaluating unit (EU2) of said combiner, said second evaluating unit providing a second probability vector (V2) indicating for each genre a second probability value indicating the probability that the video sequence belongs to the respective genre, and a second preliminary decision (PD2) indicating which genre the video sequence presumably belongs to, wherein the first evaluating unit and the second evaluating unit differ in their analyzing algorithm; and said classification signal is generated based on an evaluation of said first and second preliminary decisions, wherein if the first and second preliminary decision indicate the same genre, a classification signal is generated which classifies the video sequence as belonging to said same genre, wherein if the first and second preliminary decision indicate different genres, the first and second probability vectors are further analyzed and the classification signal is generated based on the result of said analysis, wherein the step of generating said classification signal includes: adding the first and second probability vectors and generating a sum vector (Vsum), each coordinate of said sum vector being assigned to a specific genre; determining the highest coordinate value of the sum vector; determining the genre which is assigned to the coordinate with the highest coordinate value; and generating a classification signal which classifies the video sequence as belonging to the genre associated with the coordinate having the highest coordinate value.
- The method according to claim 1, wherein the step of generating said classification signal further includes: normalizing said vector sum; comparing the highest coordinate value of the normalized sum vector (Vnorm) to a reference value (Th); and generating a classification signal that indicates an unreliable classification result if the highest coordinate value is smaller than the reference value.
- Method for classifying a video sequence (VS), characterized by analyzing the video sequence using a plurality of genre-specific detector modules (M1-M5), each genre-specific detector module providing a probability value (P1-P5) indicating the probability that the video sequence belongs to the genre assigned to the genre-specific detector module; and analyzing the probability values of the plurality of genre-specific detector modules using a combiner (CM) which analyzes said probability values and generates a classification signal (SC) classifying the video sequence as belonging to a specific genre (g); wherein the probability values of the plurality of genre-specific detector modules are analyzed by a first evaluating unit (EU1) of said combiner, said first evaluating unit providing a first probability vector (V1) comprising for each genre a first probability value indicating the probability that the video sequence belongs to the respective genre, and a first preliminary decision (PD1) indicating which genre the video sequence presumably belongs to, the probability values of the plurality of genre-specific detector modules are further analyzed by a second evaluating unit (EU2) of said combiner, said second evaluating unit providing a second probability vector (V2) indicating for each genre a second probability value indicating the probability that the video sequence belongs to the respective genre, and a second preliminary decision (PD2) indicating which genre the video sequence presumably belongs to, wherein the first evaluating unit and the second evaluating unit differ in their analyzing algorithm; and said classification signal is generated based on an evaluation of said first and second preliminary decisions, wherein if the first and second preliminary decision indicate the same genre, a classification signal is generated which classifies the video sequence as belonging to said same genre, wherein if the first and second preliminary decision indicate different genres, the first and second probability vectors are further analyzed and the classification signal is generated based on the result of said analysis, wherein the step of generating said classification signal further includes: adding the first and second probability vectors and generating a sum vector, each coordinate of said sum vector being assigned to a specific genre; normalizing said sum vector; determining the highest coordinate value of the normalized sum vector; comparing said highest coordinate value of the normalized sum vector to a reference value; and generating a classification signal indicating an unreliable classification result if the highest coordinate value is smaller than the reference value.
- Method for classifying a video sequence (VS), characterized by analyzing the video sequence using a plurality of genre-specific detector modules (M1- M5), each genre-specific detector module providing a probability value (P1-P5) indicating the probability that the video sequence belongs to the genre assigned to the genre-specific detector module; and analyzing the probability values of the plurality of genre-specific detector modules using a combiner (CM) which analyzes said probability values and generates a classification signal (SC) classifying the video sequence as belonging to a specific genre (g); wherein the probability values of the plurality of genre-specific detector modules are analyzed by a first evaluating unit (EU1) of said combiner, said first evaluating unit providing a first probability vector (V1) comprising for each genre a first probability value indicating the probability that the video sequence belongs to the respective genre, and a first preliminary decision (PD1) indicating which genre the video sequence presumably belongs to, the probability values of the plurality of genre-specific detector modules are further analyzed by a second evaluating unit (EU2) of said combiner, said second evaluating unit providing a second probability vector (V2) indicating for each genre a second probability value indicating the probability that the video sequence belongs to the respective genre, and a second preliminary decision (PD2) indicating which genre the video sequence presumably belongs to, wherein the first evaluating unit and the second evaluating unit differ in their analyzing algorithm; and said classification signal is generated based on an evaluation of said first and second preliminary decisions, wherein the first evaluating unit of said combiner calculates said first probability vector based on a given product rule, wherein the first evaluating unit of said combiner calculates the first probability vector (V1) according to the following equation: V1=(P1*(1-P2)*…*(1-Pi)*…*(1-Pn)(1-P1)*P2*…*(1-Pi)*…*(1-Pn)(1-P1)*(1-P2)*…*Pi*…*(1-Pn)…(1-P1)*(1-P2)*…*(1-Pi)*…*Pn) wherein Pi (1≦i≦n) defines the probability value provided by the ith genre-specific detector module associated with the ith genre, and n defines the number of genres and genre-specific detector modules.
- Method for classifying a video sequence (VS), characterized by analyzing the video sequence using a plurality of genre-specific detector modules (M1-M5), each genre-specific detector module providing a probability value (P1-P5) indicating the probability that the video sequence belongs to the genre assigned to the genre-specific detector module; and analyzing the probability values of the plurality of genre-specific detector modules using a combiner (CM) which analyzes said probability values and generates a classification signal (SC) classifying the video sequence as belonging to a specific genre (g); wherein the probability values of the plurality of genre-specific detector modules are analyzed by a first evaluating unit (EU1) of said combiner, said first evaluating unit providing a first probability vector (V1) comprising for each genre a first probability value indicating the probability that the video sequence belongs to the respective genre, and a first preliminary decision (PD1) indicating which genre the video sequence presumably belongs to, the probability values of the plurality of genre-specific detector modules are further analyzed by a second evaluating unit (EU2) of said combiner, said second evaluating unit providing a second probability vector (V2) indicating for each genre a second probability value indicating the probability that the video sequence belongs to the respective genre, and a second preliminary decision (PD2) indicating which genre the video sequence presumably belongs to, wherein the first evaluating unit and the second evaluating unit differ in their analyzing algorithm; and said classification signal is generated based on an evaluation of said first and second preliminary decisions, wherein the second evaluating unit of said combiner calculates said second probability vector using a support vector machine, wherein the second evaluating unit of said combiner uses a support vector machine having a Radial Basis Function, RBF, as kernel function and/or a cost parameter between 30000 and 35000 and/or a γ-value of 8.

An exemplary embodiment of the invention relates to a method for classifying a video sequence (VS), characterized by the steps of analyzing the video ...

An exemplary embodiment of the invention relates to a method for classifying a video sequence (VS), characterized by the steps of analyzing the video sequence using a plurality of genre-specific detector modules (M1-M5), each genre-specific detector module providing a probability value (P1-P5) indicating the probability that the video sequence belongs to the genre assigned to the genre-specific detector module; and analyzing the probability values of the plurality of genre-specific detector modules using a combiner (CM) which analyzes said probability values and generates a classification signal (SC) classifying the video sequence as belonging to a specific genre (g).

read more
Write Review

WOW! | |
---|---|

GOTCHA! | |

DEAL! |