• No results found

Efficient Short Boundary Detection {&} Key Frame Extraction using Image Compression

N/A
N/A
Protected

Academic year: 2020

Share "Efficient Short Boundary Detection {&} Key Frame Extraction using Image Compression"

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

Efficient Short Boundary Detection & Key Frame

Extraction using Image Compression

1. Shilpa.R.Jadhav

Department of Electronics and Communication TIT , Bhopal,

Bhopal, India

[email protected]

2. Anup.V. Kalaskar

Department of Electronics and Communication TIT, Bhopal

Bhopal, India

[email protected]

3. Prof. Shruti Bhargava

Department of Electronics and Communication TIT, Bhopal

Bhopal, India

[email protected]

AbstractThis paper present novel algorithm for efficient short boundary detection and key frames extraction using image compression. The algorithm differs from conventional methods mainly in the use of image segmentation and attention model.. Matching difference between two consecutive frames is computed with different weight. Shot boundaries are detected with automatic threshold. Key frame is extracted by using reference frame-based approach. Experimental results show improved performance of short boundary detection by using the proposed algorithms, and key frames represent shot content. And also satisfactorily image compression of result frame.

Keywords- shot boundary detection; x2 histogram; automatic threshold; key frame; image compression.

I. INTRODUCTION

The accurate segmentation of shots in a video sequence is fundamental and an essential functionality for numerous video retrieval and management tasks [1]. Many researchers have proposed algorithms to perform shot boundary detection based on certain features extracted from video frames, such as pixel differences, edge differences, color histograms, etc. Moreover, comparative surveys have also From a learning theory perspective, it is a natural approach to combine such promising features in order to decide whether a boundary exists or not within a given video sequence. Researchers have actively developed different approaches for intelligent video management, including shot transition detection, key frame extraction, video retrieval, etc.

Many approaches used different kinds of features to detect shot boundary, including histogram, shape

Information, and motion activity [2-5]. Among these approaches, histogram is the popular approach. However, in these histogram-based approaches, pixels’ space distribution was neglected. Different frames may have the same histogram [3-5]. In view of this, Change at al divided[6] each frame into r blocks, and the difference of the corresponding block of consecutive frame was computed by color histogram; the difference (i,i+1)of the two frames was obtained by adding up all the blocks’ difference; in the meanwhile, the difference

V(i.i+1)between two frames i and i+1 was measured

again without using blocks. Based on D(i.i+1) and V(i,i+1) short boundary was determined Getting over the drawback of the paper, we propose more efficient algorithms for shot boundary detection and key frame extraction with automatic threshold.

II. IDEA OF SHOT BOUNDARY DETECTION

A. Image Segmentation

Each frame is divided into blocks with mrow and n Colum. Then the difference of the Corresponding blocks between two consecutive frames is computed finally, the final difference of two frames is calculated by adding up all the difference through different weights

B. Attention Model

(2)

characteristic of pixel of different grey and the different importance of pixel of different position are considered.

C. Matching Difference

There are six kinds of histogram match [8]. Color histogram was used in computing g the matching difference in most literatures. However, through comparing several kinds of histogram matching methods, Nagasaki reached a conclusion that x2 histogram outperformed others in shot boundary recognition [8]. Hence, x2Histograms method is proposed in this method.

Step 5:

Shot boundary detection:

III. ALGORITHMS DESCRIPTION

Let F(k) be the k th frame in the video sequence

k=1,2…..Fv ( Fv denotes the total number of videos)

The algorithm of shot boundary detection is described as follows.

Algorithm 1:Shot boundary detection

Step 1:

Partitioning a frame into blocks with m rows and n columns, and B(i,j,k) stands for the block at

(i,j) in k th frame Step 2:

Computing x2 histogram matching difference between the corresponding blocks between consecutive frames in video sequence. H(i,j,k) and

H(i,j,k+1) stands for histogram of block at (i,j) in the k th

and k+1th frame respectively. Block’s difference is measured by the following equation:

Let threshold T=M D +a × STD. Shot candidate detection if D (i,i+1 )≥T ,the ith frame is the end frame

of previous shot, and the (i+1)th frame is the end frame of next shot.

Final shot detection:

shots may be very long but not much short, because those shots with only several frames cannot be captured by people and they cannot convey a whole message. Usually, a shortest shot should last for 1 to 2.5 s. For the reason of fluency, frame rate is at least 25 fps, (it is 30 fps in most cases), or flash will appear. So, a shot contains at least a minimum number of 30 to 45 frames. In our experiment, video sequences are downs amp led at 10 fp s to improve simulation speed. On this condition, the shortest shot should contain 10 to 15 frames. 13 is selected for our experiment. We formulate a “shots merging principle”: if a detected shot contain fewer frames than 13 frames, it will be merged into previous shot, or it will be thought as an independent one.

Definition 1:

where L is the number of gray in an image;

Step 3:

Reference Frame: it is the first frame of each shot; General Frames: all the frames except for reference frame; “Shot Dynamic Factor” max(i):the maximum x2Histogram within shot i ;

Computing x2 Histogram difference between two consecutive frames

Where Wij stands for the weight of block at (i,j);

Step 4:

Dynamic Shot and Static Shot:

A shot will be declared as dynamic shot, if its max(i) is bigger than MD ; otherwise it is shot; Fc(k) : The k the frame within the current shot, k=1,2,3…………FCN (k) (FCN (k) is the total number of current shot.)

Computing threshold automatically: Computing the mean and standard variance of x2 Histograms difference over the whole video sequence [7]. Mean and standard variance are defined

as follows.

The algorithm of key frame extraction is described as follows.

Algorithm 2: Key frame extraction :

Step 1:

(3)

Frame 14 Block Difference:9.570712e+005

Step 2:

a shot:

Step 3:

Searching for the maximum difference within

Determining “Shot Type” according to the

Frame 15 Block Difference:2.300548e+006

Frame 16 Block Difference:2.232479e+006

Frame 17 Block Difference:1.643245e+006

Frame 18 Block Difference:3.115060e+007

Frame 19 Block Difference:2.238738e+006

Frame 20 Block Difference:4.988129e+006 relationship between max(i) and MD: Static Shot(0) or

Dynamic Shot: Frame 21 Block Difference:3.637578e+007

Frame 22 Block Difference:9.083900e+006

Step 4:

Determining the position of key frame: if

Frame 23 Block Difference:4.059998e+006

Frame 24 Block Difference:6.239042e+006

Shot Type C=0, with respect to the odd number of a shot’s frames, the frame in the middle of shot is chose as key frame; in the case of the even number, any one frame between the two frames in the middle of shot can be chose as key frame. If Shot Type C=1, the frame with the maximum difference is declared as key frame.

IV. EXPERIMENT RESULTS AND ANALYSIS

These algorithms are tested with shots of different

Videos and found result follows:

Shot Boundary Output

Frame 1 Block Difference:2.252734e+005

Frame 2 Block Difference:3.029871e+005

Frame 3 Block Difference:1.740808e+005

Frame 4 Block Difference:4.918616e+005

Frame 5 Block Difference:2.299742e+005

Frame 6 Block Difference:9.655472e+005

Frame 7 Block Difference:2.420272e+005

Frame 8 Block Difference:3.452967e+005

Frame 9 Block Difference:2.035338e+005

Frame 10 Block Difference:2.138207e+005

Frame 11 Block Difference:2.306950e+005

Frame 12 Block Difference:1.043614e+006

Frame 13 Block Difference:1.545241e+006

Frame 25 Block Difference:3.184043e+006

Frame 26 Block Difference:2.128465e+007

Frame 27 Block Difference:4.234818e+006

Frame 28 Block Difference:1.517516e+007

Frame 29 Block Difference:9.038847e+005

Frame 30 Block Difference:9.291212e+005

Frame 31 Block Difference:1.816369e+006

Frame 32 Block Difference:1.377436e+006

Frame 33 Block Difference:2.366793e+006

Frame 34 Block Difference:2.499197e+006

Frame 35 Block Difference:4.265971e+006

Frame 36 Block Difference:4.664410e+006

Frame 37 Block Difference:1.645068e+007

Frame 38 Block Difference:2.491203e+007

Frame 39 Block Difference:2.655367e+007

Frame 40 Block Difference:3.462600e+007

Frame 41 Block Difference:6.263576e+006

Frame 42 Block Difference:3.812128e+007

Frame 43 Block Difference:3.823555e+007

(4)

Frame 45 Block Difference:3.445289e+007

Frame 46 Block Difference:3.606135e+007

Frame 47 Block Difference:1.312662e+006

Frame 48 Block Difference:2.046394e+006

Frame 49 Block Difference:2.479815e+006

Frame 50 Block Difference:3.438300e+006

Frame 51 Block Difference:1.741411e+006

Frame 52 Block Difference:1.660074e+006

Frame 53 Block Difference:1.407927e+006

Frame 54 Block Difference:1.806781e+006

Frame 55 Block Difference:1.556794e+006

Frame 56 Block Difference:6.257250e+005

Frame 57 Block Difference:4.741886e+005

Frame 58 Block Difference:8.067111e+005

Frame 59 Block Difference:8.242188e+005

Frame 60 Block Difference:1.019812e+006

Frame 61 Block Difference:1.279052e+006

Frame 62 Block Difference:1.089395e+006

Frame 63 Block Difference:1.080301e+006

Frame 64 Block Difference:1.494956e+006

Frame 65 Block Difference:1.529653e+007

Frame 66 Block Difference:2.561012e+006

Frame 67 Block Difference:3.227772e+006

Frame 68 Block Difference:3.426323e+006

Frame 69 Block Difference:4.467072e+006

Frame 70 Block Difference:5.090183e+006

Frame 71 Block Difference:6.058270e+006

Frame 72 Block Difference:4.459274e+006

Frame 73 Block Difference:1.645334e+007

Frame 74 Block Difference:2.056309e+007

Frame 75 Block Difference:6.206481e+006

Frame 76 Block Difference:2.683221e+007

Frame 77 Block Difference:3.083760e+007

Frame 78 Block Difference:2.422882e+007

Frame 79 Block Difference:3.405632e+006

Frame 80 Block Difference:2.789725e+007

Frame 81 Block Difference:3.033992e+007

Frame 82 Block Difference:9.515336e+005

Frame 83 Block Difference:9.183184e+005

Frame 84 Block Difference:6.327894e+005

Frame 85 Block Difference:7.926676e+005

Frame 86 Block Difference:7.424013e+005

Frame 87 Block Difference:7.223795e+005

Frame 88 Block Difference:7.930851e+005

Frame 89 Block Difference:1.136819e+006

Frame 90 Block Difference:2.802542e+006

Frame 91 Block Difference:3.241871e+006

Frame 92 Block Difference:3.743190e+006

Frame 93 Block Difference:2.640398e+006

Frame 94 Block Difference:2.253193e+006

Frame 95 Block Difference:3.129525e+006

Frame 96 Block Difference:1.731097e+007

Frame 97 Block Difference:6.908658e+006

Frame 98 Block Difference:2.756800e+007

Frame 99 Block Difference:6.642552e+006

Frame 100 Block Difference:1.704278e+007

Frame 101 Block Difference:1.828782e+006

Frame 102 Block Difference:4.445243e+007

Frame 103 Block Difference:2.719461e+007

Frame 104 Block Difference:2.154940e+006

Frame 105 Block Difference:7.612865e+005

(5)

Frame 107 Block Difference:6.521334e+005

Frame 108 Block Difference:1.169921e+006

Frame 109 Block Difference:1.626765e+006

Frame 110 Block Difference:1.856963e+006

Frame 111 Block Difference:1.327165e+006

Frame 112 Block Difference:1.612671e+006

Frame 113 Block Difference:2.237268e+006

Frame 114 Block Difference:3.684151e+006

Frame 115 Block Difference:3.752450e+006

Frame 116 Block Difference:1.805379e+006

Frame 117 Block Difference:1.434608e+006

Frame 118 Block Difference:1.498452e+006

Frame 119 Block Difference:1.359151e+006

Mean:7711831.886797,

Std Deviation:11187076.248366

Threshold:18898908.135163

Extracted Key Frames

Writing key frame to file

Frame 18 has a block difference threshold

Writing key frame to file

Frame 21 has a block difference threshold

Writing key frame to file

Frame 26 has a block difference threshold

Writing key frame to file

Frame 38 has a block difference threshold

Writing key frame to file

Frame 39 has a block difference threshold

Writing key frame to file

Frame 40 has a block difference threshold

Writing key frame to file

Frame 42 has a block difference threshold

Writing key frame to file

Frame 43 has a block difference threshold

Writing key frame to file

Frame 44 has a block difference threshold

Writing key frame to file

Frame 45 has a block difference threshold

Writing key frame to file

Frame 46 has a block difference threshold

Writing key frame to file

Frame 74 has a block difference threshold

Writing key frame to file

Frame 76 has a block difference threshold

Writing key frame to file

Frame 77 has a block difference threshold

Writing key frame to file

Frame 78 has a block difference threshold

Writing key frame to file

Frame 80 has a block difference threshold

Writing key frame to file

Frame 81 has a block difference threshold

Writing key frame to file

Frame 98 has a block difference threshold

(6)

V. CONCLUSION

We proposed new model to reduce the boundary of key frame and key frame extraction system based mage segmentation and attention model. We first segmented each frame, and then employed different weights to compute the matching difference and threshold. By using the automatic threshold, we detected boundary, and by using image compression image get compress. Next, reference frame-based approach was provided to extract key frame. Our experimental results show that the proposed boundary detection algorithm can provide accurate and reduced boundary result The extracted key frame scan satisfactorily represent the content of video In order to further improve accuracy, we will use multimodal information to segment video.

REFERENCES

[1] L. Zhu, M .S Guido, and Aggelos K.K., “MIMAX optimal video summarization”, IEEE Transaction on Circuits and System for Video Technology, Vol.15, No.10, October 2005, pp. 1245-1256 .

[2] A.Hanjalic, “Shot-boundary detection: Unraveledand resolved?”, IEEE Transaction on Circuits and System for Video Technology., Vol.12, No.2, February, 2002, pp. 90-105

[3] Z.Cern ekova, I. Pitas, and C Nikou, “Information theory-based shot cut/fade detection and video summarization”, IEEE Transaction on Circuits and System for Video Technology, Vol.16, No.1 January 2006, pp. 82-91

[4] N. Babaguchi, Y. Kawai,T. Ogura, and T. Kitahashi“Personalized abstraction of broadcasted American

football video b y highlight selection”, IEEE Transaction On Multimedia, Vol.6,No.4, August 2004, pp. 575-586

[5] A. Hanjalic, “Multimodal approach tomeasuring excitement in video”, Proceedings of International Conference on Multimedia and Expo ICME 03[C] Vol.2, July 2003, pp. 289-292

[6] Y. Chen g, X. Yang, and D. Xu,“A method for shot boundary detection with automatic threshold”, TENCON’02. Proceedings. 2002 IEEE Region10conferenceonComputers, Communication, Control and Power Engineering[C], Vol.1, October 2002

[7] Y. Zhuang, Y. Rui, T.S. Huang, and S. Mehrotra“Adaptive key frame extraction using unsupervised clustering”, Proceeding.ICIP’98[C], Chicago, IL, 1998, Vol.1, pp. 866-870.

References

Related documents

The quality of freshly salvaged red blood cells is at least comparable with stored red blood cells [ 62 ]. An absolute contraindication to indirect autologous transfusion is

The comparison suggests in 2003/04 smallholder profits from growing burley were limited by two main factors: first, the practices of leaf merchant companies on the auction floors

BioMed Central World Journal of Surgical Oncology ss Open AcceCase report Serous adenocarcinoma of the fallopian tube, associated with verrocous carcinoma of the uterine cervix a

Tong et al EURASIP Journal on Wireless Communications and Networking 2014, 2014 150 http //jwcn eurasipjournals com/content/2014/1/150 RESEARCH Open Access Compressive sensing image

By measuring geometric errors of the finished test piece, the position and the orientation of rotary axis aver- age lines (location errors), as well as position-dependent

We examine relationships between fame and premature mortality and test how such relationships vary with type of performer (eg, solo or band member) and nationality and whether cause

NCRP will apply the tool in multiple sites, report the results, engage funders in dialogue about the findings at local events, connect interested funders to peers and experts

In mild of the proposed revocable IBE make with CRA, we built up a CR Aided affirmation plot with length- restrained offers for adapting to a popular