MULTI-OBJECT TRACKING USING STGMM_KLT METHOD

(1)

277 | P a g e

MULTI-OBJECT TRACKING USING STGMM_KLT

METHOD

S.Sathyanarayanan

1

, Dr. Anna Saro Vijendran

2

1

Department of Computer Science, PhD Research Scholar, Sri Ramakrishna College of Arts &

Science, Coimbatore.

2

Dean, School of Computing, Sri Ramakrishna College of Arts & Science, Coimbatore.

ABSTRACT

Segmentation and chase are two necessary aspects in visual scrutiny work systems. Several barriers like littered

background, camera movements, and occlusion create the strong detection and chase a tough drawback,

particularly just in case of multiple moving objects. Object detection within the presence of camera noise and

with variable or unfavorable brightness level conditions continues to be a lively space of analysis. This paper

proposes a framework which might effectively find the moving objects and track them despite of occlusion and a

priori information of objects within the scene. The segmentation step uses a strong threshold call formula that

uses a multi-background model. The video object chase is in a position to trace multiple objects beside their

trajectories supported Continuous Energy step-down. The projected technique combines extended Kalman filter

with past and color info for chase multiple objects below high occlusion. The projected technique is powerful to

background modeling technique. Object detection is completed victimisation spatio-temporal mathematician

mixture model (STGMM). Chase consists of 2 steps: partly occluded object chase and extremely occluded object

chase. Chase partly occluded objects, extended Kalman filter is exploited with past info of object, whereas for

extremely occluded object chase, color info and size attributes are used. The projected technique is assessed

quantitatively victimization the exactitude and recall accuracy metrics. Further, comparative analysis with

connected works has been dispensed to exhibit the effectiveness of the projected technique.

1. INTRODUCTION

Visual police investigation is one among the foremost active analysis areas in laptop vision. Associate in nursing

automatic visual closed-circuit television usually contains laptop vision tasks like motion segmentation, object

classification, tracking, recognition and high-level motion analysis [1, 2]. Motion segmentation task detects and

extracts moving objects from video frames. Object classification task categorizes the detected objects into

predefined categories like human, vehicle, animal, clutter, etc. Object pursuit task establishes the state of object

within the sequential frames of a video. Further, object pursuit will be performed on one object or multiple

(2)

278 | P a g e

objects among its Field Of read (FOV) and inter-camera approach uses multiple cameras to trace objects in

overlapping FOV [3]. Finally, the high-level motion analysis task acknowledges the objects and interprets their

activities. Object pursuit encompasses a wide range of applications like good video police investigation, traffic

video observance, accident prediction and detection, motion-based recognition, human laptop interaction,

human behavior understanding, etc. additionally, the constant threats from terrorist attacks at public and secured

places increase the requirement for economical police investigation systems with embedded object pursuit

subsystems. Further, the advancement in computing technology and convenience of top quality cameras at low

value has redoubled the employment of object pursuit in various applications [1-4].

Tracking algorithms estimate the thing motion. Trackers need solely data format, square measure quick and turn

out swish trajectories. On the opposite hand, they accumulate error throughout run-time (drift) and usually fail if

the thing disappears from the camera read. Analysis in pursuit aims at developing more and more strong trackers

that track “longer”. The post-failure behavior isn't directly addressed. Detection based mostly algorithms

estimate the thing location in each frame severally. Detectors don't drift and don't fail if the thing disappears

from the camera read. However, they need Associate in nursing offline coaching stage and thus cannot be

applied to unknown objects. This paper intends to:

1. Propose a brand new methodology which mixes multi-background registration based mostly object findion to

detect objects below dynamic backgrounds and pursuit supported continuous energy reduction.

2. To get higher results despite of occlusions in complicated backgrounds

II. LITERATURE REVIEW

Berclaz et al. [6] propose associate degree rule for frame-by-frame detection associate degreed linking the

trajectories of an unknown range of targets for multi-object following mistreatment K-shortest path

optimization. Zhai et al. [7] propose associate degree approach to trace associate degree object by a dynamic

model from a finite set of models. Because the single-model assumption may cause huntsman unstable if the

target has complicated flight or the camera has abrupt ego-motions.

Between objects underneath confused things. A following theme projected by Zulfiqar et al. [8] employs particle

filter and multi-mode anisotropic mean shift algorithms. They track the thing mistreatment solely fifteen

particles that increase the machine speed. The projected methodology eliminates the track drift and loss of track

throughout occlusion. a unique approach for sturdy object following has been brought by Li et al. [9]. They

track quite 3 occluded objects mistreatment dominant color bar graph. Moreover, the chosen colours are

supported the given distance measure that is additionally sturdy to illumination amendment.

Region based mostly following across 3 cameras mistreatment Kalman filter is projected in [7]. Girisha and

Murali [8, 9] adopted optical flow based mostly methodology for object following mistreatment two-way

ANOVA to match extracted options of video frames. However, the rule doesn't maintain the identity of the

(3)

279 | P a g e

are used for following is projected in [10]. A feature based mostly head huntsman that uses color info and

particle filter is projected in [11] for single object following. Another feature based mostly methodology that

uses the perimeters of object are projected in [12]. However, the strategy is meant just for following the face of

someone. Wang et al. [13] propose a scene accommodative feature based mostly methodology to trace multiple

objects.

Martin and Martinez [14] used color based mostly look, motion info and accommodative particle filter to trace

object. Kristan et al. [15] computed native motion of target and used color based mostly particle filter for

following objects. However, the model can't be used for all sensible applications because the object has got to be

hand-picked manually and is computationally overpriced. Learning based mostly methodology that learns color,

size and motion to trace objects across cameras mistreatment Kalman filter is projected in [16]. a well known

work particularly Hydra [17] tracks head by templet matching approach. A model based mostly methodology

that uses Principle axis of associate degree object and Homography constraints to match and track the thing

across completely different views of camera in Kalman filter framework is projected in [18]. However, the

strategy might not be applicable for sensible applications as landmarks on the bottom plane are needed and

therefore the system has the limitation to figure in controlled atmosphere. Xi et al. [19] propose a multiple

objects following rule within which object association in ordered frames relies on whole number programming

flow model. However, the rule can't be used for period state of affairs because the accuracy decreases in untidy

atmosphere. Jahandide et al. [20] propose associate degree rule to trace single moving object. They half-track

the thing mistreatment look and motion model mistreatment Kalman filter. However, the user must offer the

objects’ position within the 1st frame for following.

III.METHODOLOGY

Feature Extraction

There square measure numerous sorts of feature extraction with relation to satellite pictures. The similar options

along kind a feature vector to spot Associate in Nursing classify an object. Numerous feature extraction

techniques are explained intimately Color is one in every of the foremost vital options with the assistance of that

humans will simply acknowledge pictures. It’s most communicatory of all the visual options. It’s

straightforward to extract, Associate in Nursingalyze and represent an object. Attributable to their very little

linguistics which means and its compact illustration, color options tend to be a lot of domain freelance compared

to alternative options [8]. Its property of unchangingness with relation to the dimensions of the image and

orientation of objects on that build it an acceptable alternative for feature extraction in pictures. The standard of

feature vector depends for the most part on the colour house used for illustration. Color options square measure

described victimization color moments, fuzzy color moments, color bar graph etc. Therefore, it's a lot of

(4)

280 | P a g e

Color moments

Color distribution of pictures is often described effectively and expeditiously victimization color moments.

Color moments supply machine simplicity, speedy retrieval, and nominal storage [8]. This square measure

terribly strong to advanced background and freelance of image size and orientation [9]. Color moments feature

vector could be a terribly compact illustration as compared to alternative techniques attributable to that it's going

to even have lower discrimination power. Therefore, it is often used because the 1st pass to scale back the search

house. There square measure 1st order (mean), second order (variance) and third order (skewness) color

moments that square measure described as below.

Mean= 1/M*N P[i] [j]

Var=sqrt (1/M*N P[i] [j]-(mean). ^2

Where M and N are the image template’s height and width and P[i][j] are the pixel values.

Texture

Texture is one in all the vital options in image analysis for several applications. Texture analysis makes

an attempt to quantify intuitive qualities represented by terms like rough, smooth, silky, or jarring as a perform

of the special variation in constituent intensities. The selection of the textural options ought to be as compact as

attainable and nonetheless as discriminating as attainable [14]. It’s essential to search out a collection of texture

options with sensible discriminating power, so as to style associate degree economical algorithmic program for

texture classification. Texture options are often found exploitation strategies as Gabor Filter, Haar wave

Decomposition and wave GLCM etc.

Shape

Shape is one in all the foremost vital options in feature extraction. they're sometimes represented once

the image has been metameric into totally different regions or objects. Form description is often categorized into

either region based mostly} or boundary based. A decent form illustration feature for associate degree object

ought to be invariant to translation, rotation and scaling.

IV.PROPOSED WORK

The planned methodology caterpillar-tracked multiple objects in very scene exploitation EKF and once they

were occluded, color info was wont to decide between objects. Because the color info was integrated to Kalman

filtering, the planned methodology might with efficiency track multiple objects underneath high occlusion the

planned methodology. The planned methodology consists of 4 steps; background modeling, extended Kalman

(5)

281 | P a g e

Background Modeling during this step, we tend to review the STGMM planned by sol et al. [3]. The planned

methodology considers temporal behavior yet as special relations. Elaborate rationalization of the planned

STGMM is often reviewed in [10].

Extracting Dominant Color

When objects were occluded and shown as integrated blobs in STGMM, they were being treated as new objects

by the standard EKF. Therefore, we wanted some invariant attributes i.e., object category and object color, to

trace objects beneath high occlusion. The foreground extracted from STGMM was taken into thought. The RGB

values of every element of the upper-half of the thing were found and classified into n three categories,

wherever n equals sixteen. For every object the bar chart of most frequent color was found so taken as invariant

attribute of the thing. Then victimization Bhattacharyya distance, the objects before merging were compared to

the objects when the demerging to designate their objects IDs. The algorithms given in Fig. three clearly states

the foremost occurring color of the thing was extracted and wont to compare at demerging to designate chase

IDs properly. The thing disappeared and reappeared was conjointly tracked with single distinctive ID throughout

the scene.

EXPERIMENTAL RESULTS

The experimental results are bestowed that shows the nice chase of moving freelance and partial occluded

objects. The direction of object was maintained to recover it’s chase ID when partial merging and past data for

ten frames to re-track object showing when few frames by STGMM. The results are bestowed. Left [*fr1]

represents planned technique and right is its STGMM.

Figure show re-merged object

S.NO algorithm Accuracy precision Recall

1 CRF 90.6 86 87

2 GMM 93.4 89 89

(6)

282 | P a g e

This figure accuracy comparison of existing and proposed algorithms

This figure recall comparison of existing and proposed algorithms

(7)

283 | P a g e

V. CONCLUSION

Multi-target chase was done victimization EKF with past info of objects once there have been partly occluded

and conjointly disappeared and reappeared by poor background modeling. Once the objects were extremely

occluded, invariant attributes like color and size were integrated to EKF with past info to resolve chase

challenges. In future, we'll even be investigation the behavior and event of the item associated to chase.

Segmentation of horizontal incorporated objects in STGMM would be taken into thought. Also, we'd wish to

track multiple objects across the viewpoints victimization single dome camera supported Kalman prediction,

past and color info.

REFERENCES

[1] J. L. Xing, H. Z. Ai, L. W. Liu, and S. H. Lao, “Multiple player tracking in sports video: A dual-mode

two-way Bayesian inference approach with progressive observation modeling,” IEEE Transaction on Image

Processing, pp. 1652-1667, June 2011.

[2] Y. Benezeth, P. Jodoin, B. Emile, H. Laurent, and C. Rosenberger, “Review and evaluation of

commonly-implemented background subtraction algorithms,” in Proc. 19th International Conference on Pattern

Recognition, 2008, pp. 1-4.

[3] Y. S. Soh, Y. S. Hae, and I. Kim, “Spatio-temporal gaussian mixture model for background modeling,” in

Proc. 2012 IEEE International Symposium on Multimedia (ISM), Dec. 2012, pp. 360-363.

[4] S. J. Julier and J. K. Uhlmann, "Unscented filtering and nonlinear estimation,” in Proc. the IEEE, 2004, pp.

401–422.

[5] C. Y. Liu, P. L. Shui, and S. Li, "Unscented extended Kalman filter for target tracking," Journal of Systems

Engineering and Electronics, pp. 188-192, April 2011.

[6] J. Berclaz, F. Fleuret, E. Turetken, and P. Fua, “Multiple object tracking using K-Shortest paths

optimization,” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1806-1819, 2011.

[7] Y. Zhai, M. B. Yeary, S. Cheng, and N. Kehtarnavaz, "An object-tracking algorithm based on

multiple-model particle filtering with state partitioning," IEEE Transactions on Instrumentation and Measurement, pp.

1797-1809, 2009.

[8] Z. H. Khan, I. Y.-H. Gu, and A. G. Backhouse, "Robust visual object tracking using multi-mode anisotropic

mean shift and particle filters," IEEE Transactions on Circuits and Systems for Video Technology, pp. 74-87,

2011.

[9] L. Y. Li, W. M. Huang, I. Y.-H. Gu, R. J. Luo, and Q. Tian, "An efficient sequential approach to tracking

multiple objects through crowds for real-time intelligent CCTV systems," IEEE Transactions on Systems, Man,

and Cybernetics, Part B: Cybernetics, pp. 1254-1269, 2008.

(8)

284 | P a g e

[11] D N Verma, Sahil Narang, Bhawna Juneja, “Texture based Image Retrieval Using Correlation on Haar Wavelet Transform”, Janahan Lal Stephen (Ed.): CNC 2012, LNICST 2012, pp. 81–86. © Institute for

Computer Sciences, Social Informatics and Telecommunications Engineering 2012.

[12] Kekre, H. B., Sudeep D. Thepade, and Akshay Maloo. "Performance Comparison of Image Retrieval Using

Fractional Coefficients of Transformed Image Using DCT, Walsh, Haar and Kekre’s Transform."

CSC-International Journal of Image processing (IJIP) 4.2 (2010): 142-155.

[13] Hu, Ming-Kuei. "Visual pattern recognition by moment invariants." Information Theory, IRE Transactions

on 8.2 (1962): 179-187.

[14] Reddy, BV Ramana, et al. "Classification of Textures Based on Features Extracted from Preprocessing

Images on Random Windows." jiP 1 (2009):

[15] M. Kristan, J. Pers, S. Kovacic and A. Leonardis, A local-motion-based probabilistic model for visual

tracking, ELSEVIER, Pattern Recognition, vol. 40, pp. 2160–2168, 2009.

[16] A. Gilbert and R. Bowden, Incremental, scalable tracking of objects inter camera, ELSEVIER, Computer

Vision and Image Understanding, vol. 111, pp. 43-58, 2008.

[17] I. Haritaoglu, D. Harwood and L. S. Davis, Hydra: Multiple People Detection and tracking using

Silhouettes, IEEE, 1999.

[18] Dong-Wook Seo, Hyun-Uk Chae, Byeong-Woo Kim, Won-Ho Choi and Kang-Hyun Jo, Human Tracking

based on Multiple View Homography, Journal of Universal Computer Science, vol. 15, no. 13, pp. 2463-2484,

2009.

[19] Zhenghao Xi, Dongmei Xu, Wanqing Song, Yang Zheng, A* algorithm with dynamic weights for multiple

object tracking in video sequence, Optik - International Journal for Light and Electron Optics, Volume 126,

Issue 20, October 2015, Pages 2500-2507, ISSN 0030-4026, http://dx.doi.org/10.1016/j.ijleo.2015.06.020.

[20] Hamidreza Jahandide, Kamal Mohamedpour, Hamid Abrishami Moghaddam, A hybrid motion and

appearance prediction model for robust visual object tracking, Pattern Recognition Letters, Volume 33, Issue 16,