277 | P a g e
MULTI-OBJECT TRACKING USING STGMM_KLT
METHOD
S.Sathyanarayanan
1, Dr. Anna Saro Vijendran
21
Department of Computer Science, PhD Research Scholar, Sri Ramakrishna College of Arts &
Science, Coimbatore.
2
Dean, School of Computing, Sri Ramakrishna College of Arts & Science, Coimbatore.
ABSTRACT
Segmentation and chase are two necessary aspects in visual scrutiny work systems. Several barriers like littered
background, camera movements, and occlusion create the strong detection and chase a tough drawback,
particularly just in case of multiple moving objects. Object detection within the presence of camera noise and
with variable or unfavorable brightness level conditions continues to be a lively space of analysis. This paper
proposes a framework which might effectively find the moving objects and track them despite of occlusion and a
priori information of objects within the scene. The segmentation step uses a strong threshold call formula that
uses a multi-background model. The video object chase is in a position to trace multiple objects beside their
trajectories supported Continuous Energy step-down. The projected technique combines extended Kalman filter
with past and color info for chase multiple objects below high occlusion. The projected technique is powerful to
background modeling technique. Object detection is completed victimisation spatio-temporal mathematician
mixture model (STGMM). Chase consists of 2 steps: partly occluded object chase and extremely occluded object
chase. Chase partly occluded objects, extended Kalman filter is exploited with past info of object, whereas for
extremely occluded object chase, color info and size attributes are used. The projected technique is assessed
quantitatively victimization the exactitude and recall accuracy metrics. Further, comparative analysis with
connected works has been dispensed to exhibit the effectiveness of the projected technique.
1. INTRODUCTION
Visual police investigation is one among the foremost active analysis areas in laptop vision. Associate in nursing
automatic visual closed-circuit television usually contains laptop vision tasks like motion segmentation, object
classification, tracking, recognition and high-level motion analysis [1, 2]. Motion segmentation task detects and
extracts moving objects from video frames. Object classification task categorizes the detected objects into
predefined categories like human, vehicle, animal, clutter, etc. Object pursuit task establishes the state of object
within the sequential frames of a video. Further, object pursuit will be performed on one object or multiple
278 | P a g e
objects among its Field Of read (FOV) and inter-camera approach uses multiple cameras to trace objects in
overlapping FOV [3]. Finally, the high-level motion analysis task acknowledges the objects and interprets their
activities. Object pursuit encompasses a wide range of applications like good video police investigation, traffic
video observance, accident prediction and detection, motion-based recognition, human laptop interaction,
human behavior understanding, etc. additionally, the constant threats from terrorist attacks at public and secured
places increase the requirement for economical police investigation systems with embedded object pursuit
subsystems. Further, the advancement in computing technology and convenience of top quality cameras at low
value has redoubled the employment of object pursuit in various applications [1-4].
Tracking algorithms estimate the thing motion. Trackers need solely data format, square measure quick and turn
out swish trajectories. On the opposite hand, they accumulate error throughout run-time (drift) and usually fail if
the thing disappears from the camera read. Analysis in pursuit aims at developing more and more strong trackers
that track “longer”. The post-failure behavior isn't directly addressed. Detection based mostly algorithms
estimate the thing location in each frame severally. Detectors don't drift and don't fail if the thing disappears
from the camera read. However, they need Associate in nursing offline coaching stage and thus cannot be
applied to unknown objects. This paper intends to:
1. Propose a brand new methodology which mixes multi-background registration based mostly object findion to
detect objects below dynamic backgrounds and pursuit supported continuous energy reduction.
2. To get higher results despite of occlusions in complicated backgrounds
II. LITERATURE REVIEW
Berclaz et al. [6] propose associate degree rule for frame-by-frame detection associate degreed linking the
trajectories of an unknown range of targets for multi-object following mistreatment K-shortest path
optimization. Zhai et al. [7] propose associate degree approach to trace associate degree object by a dynamic
model from a finite set of models. Because the single-model assumption may cause huntsman unstable if the
target has complicated flight or the camera has abrupt ego-motions.
Between objects underneath confused things. A following theme projected by Zulfiqar et al. [8] employs particle
filter and multi-mode anisotropic mean shift algorithms. They track the thing mistreatment solely fifteen
particles that increase the machine speed. The projected methodology eliminates the track drift and loss of track
throughout occlusion. a unique approach for sturdy object following has been brought by Li et al. [9]. They
track quite 3 occluded objects mistreatment dominant color bar graph. Moreover, the chosen colours are
supported the given distance measure that is additionally sturdy to illumination amendment.
Region based mostly following across 3 cameras mistreatment Kalman filter is projected in [7]. Girisha and
Murali [8, 9] adopted optical flow based mostly methodology for object following mistreatment two-way
ANOVA to match extracted options of video frames. However, the rule doesn't maintain the identity of the
279 | P a g e
are used for following is projected in [10]. A feature based mostly head huntsman that uses color info and
particle filter is projected in [11] for single object following. Another feature based mostly methodology that
uses the perimeters of object are projected in [12]. However, the strategy is meant just for following the face of
someone. Wang et al. [13] propose a scene accommodative feature based mostly methodology to trace multiple
objects.
Martin and Martinez [14] used color based mostly look, motion info and accommodative particle filter to trace
object. Kristan et al. [15] computed native motion of target and used color based mostly particle filter for
following objects. However, the model can't be used for all sensible applications because the object has got to be
hand-picked manually and is computationally overpriced. Learning based mostly methodology that learns color,
size and motion to trace objects across cameras mistreatment Kalman filter is projected in [16]. a well known
work particularly Hydra [17] tracks head by templet matching approach. A model based mostly methodology
that uses Principle axis of associate degree object and Homography constraints to match and track the thing
across completely different views of camera in Kalman filter framework is projected in [18]. However, the
strategy might not be applicable for sensible applications as landmarks on the bottom plane are needed and
therefore the system has the limitation to figure in controlled atmosphere. Xi et al. [19] propose a multiple
objects following rule within which object association in ordered frames relies on whole number programming
flow model. However, the rule can't be used for period state of affairs because the accuracy decreases in untidy
atmosphere. Jahandide et al. [20] propose associate degree rule to trace single moving object. They half-track
the thing mistreatment look and motion model mistreatment Kalman filter. However, the user must offer the
objects’ position within the 1st frame for following.
III.METHODOLOGY
Feature Extraction
There square measure numerous sorts of feature extraction with relation to satellite pictures. The similar options
along kind a feature vector to spot Associate in Nursing classify an object. Numerous feature extraction
techniques are explained intimately Color is one in every of the foremost vital options with the assistance of that
humans will simply acknowledge pictures. It’s most communicatory of all the visual options. It’s
straightforward to extract, Associate in Nursingalyze and represent an object. Attributable to their very little
linguistics which means and its compact illustration, color options tend to be a lot of domain freelance compared
to alternative options [8]. Its property of unchangingness with relation to the dimensions of the image and
orientation of objects on that build it an acceptable alternative for feature extraction in pictures. The standard of
feature vector depends for the most part on the colour house used for illustration. Color options square measure
described victimization color moments, fuzzy color moments, color bar graph etc. Therefore, it's a lot of
280 | P a g e
Color moments
Color distribution of pictures is often described effectively and expeditiously victimization color moments.
Color moments supply machine simplicity, speedy retrieval, and nominal storage [8]. This square measure
terribly strong to advanced background and freelance of image size and orientation [9]. Color moments feature
vector could be a terribly compact illustration as compared to alternative techniques attributable to that it's going
to even have lower discrimination power. Therefore, it is often used because the 1st pass to scale back the search
house. There square measure 1st order (mean), second order (variance) and third order (skewness) color
moments that square measure described as below.
Mean= 1/M*N P[i] [j]
Var=sqrt (1/M*N P[i] [j]-(mean). ^2
Where M and N are the image template’s height and width and P[i][j] are the pixel values.
Texture
Texture is one in all the vital options in image analysis for several applications. Texture analysis makes
an attempt to quantify intuitive qualities represented by terms like rough, smooth, silky, or jarring as a perform
of the special variation in constituent intensities. The selection of the textural options ought to be as compact as
attainable and nonetheless as discriminating as attainable [14]. It’s essential to search out a collection of texture
options with sensible discriminating power, so as to style associate degree economical algorithmic program for
texture classification. Texture options are often found exploitation strategies as Gabor Filter, Haar wave
Decomposition and wave GLCM etc.
Shape
Shape is one in all the foremost vital options in feature extraction. they're sometimes represented once
the image has been metameric into totally different regions or objects. Form description is often categorized into
either region based mostly} or boundary based. A decent form illustration feature for associate degree object
ought to be invariant to translation, rotation and scaling.
IV.PROPOSED WORK
The planned methodology caterpillar-tracked multiple objects in very scene exploitation EKF and once they
were occluded, color info was wont to decide between objects. Because the color info was integrated to Kalman
filtering, the planned methodology might with efficiency track multiple objects underneath high occlusion the
planned methodology. The planned methodology consists of 4 steps; background modeling, extended Kalman
281 | P a g e
Background Modeling during this step, we tend to review the STGMM planned by sol et al. [3]. The planned
methodology considers temporal behavior yet as special relations. Elaborate rationalization of the planned
STGMM is often reviewed in [10].
Extracting Dominant Color
When objects were occluded and shown as integrated blobs in STGMM, they were being treated as new objects
by the standard EKF. Therefore, we wanted some invariant attributes i.e., object category and object color, to
trace objects beneath high occlusion. The foreground extracted from STGMM was taken into thought. The RGB
values of every element of the upper-half of the thing were found and classified into n three categories,
wherever n equals sixteen. For every object the bar chart of most frequent color was found so taken as invariant
attribute of the thing. Then victimization Bhattacharyya distance, the objects before merging were compared to
the objects when the demerging to designate their objects IDs. The algorithms given in Fig. three clearly states
the foremost occurring color of the thing was extracted and wont to compare at demerging to designate chase
IDs properly. The thing disappeared and reappeared was conjointly tracked with single distinctive ID throughout
the scene.
EXPERIMENTAL RESULTS
The experimental results are bestowed that shows the nice chase of moving freelance and partial occluded
objects. The direction of object was maintained to recover it’s chase ID when partial merging and past data for
ten frames to re-track object showing when few frames by STGMM. The results are bestowed. Left [*fr1]
represents planned technique and right is its STGMM.
Figure show re-merged object
S.NO algorithm Accuracy precision Recall
1 CRF 90.6 86 87
2 GMM 93.4 89 89
282 | P a g e
This figure accuracy comparison of existing and proposed algorithms
This figure recall comparison of existing and proposed algorithms
283 | P a g e
V. CONCLUSION
Multi-target chase was done victimization EKF with past info of objects once there have been partly occluded
and conjointly disappeared and reappeared by poor background modeling. Once the objects were extremely
occluded, invariant attributes like color and size were integrated to EKF with past info to resolve chase
challenges. In future, we'll even be investigation the behavior and event of the item associated to chase.
Segmentation of horizontal incorporated objects in STGMM would be taken into thought. Also, we'd wish to
track multiple objects across the viewpoints victimization single dome camera supported Kalman prediction,
past and color info.
REFERENCES
[1] J. L. Xing, H. Z. Ai, L. W. Liu, and S. H. Lao, “Multiple player tracking in sports video: A dual-mode
two-way Bayesian inference approach with progressive observation modeling,” IEEE Transaction on Image
Processing, pp. 1652-1667, June 2011.
[2] Y. Benezeth, P. Jodoin, B. Emile, H. Laurent, and C. Rosenberger, “Review and evaluation of
commonly-implemented background subtraction algorithms,” in Proc. 19th International Conference on Pattern
Recognition, 2008, pp. 1-4.
[3] Y. S. Soh, Y. S. Hae, and I. Kim, “Spatio-temporal gaussian mixture model for background modeling,” in
Proc. 2012 IEEE International Symposium on Multimedia (ISM), Dec. 2012, pp. 360-363.
[4] S. J. Julier and J. K. Uhlmann, "Unscented filtering and nonlinear estimation,” in Proc. the IEEE, 2004, pp.
401–422.
[5] C. Y. Liu, P. L. Shui, and S. Li, "Unscented extended Kalman filter for target tracking," Journal of Systems
Engineering and Electronics, pp. 188-192, April 2011.
[6] J. Berclaz, F. Fleuret, E. Turetken, and P. Fua, “Multiple object tracking using K-Shortest paths
optimization,” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1806-1819, 2011.
[7] Y. Zhai, M. B. Yeary, S. Cheng, and N. Kehtarnavaz, "An object-tracking algorithm based on
multiple-model particle filtering with state partitioning," IEEE Transactions on Instrumentation and Measurement, pp.
1797-1809, 2009.
[8] Z. H. Khan, I. Y.-H. Gu, and A. G. Backhouse, "Robust visual object tracking using multi-mode anisotropic
mean shift and particle filters," IEEE Transactions on Circuits and Systems for Video Technology, pp. 74-87,
2011.
[9] L. Y. Li, W. M. Huang, I. Y.-H. Gu, R. J. Luo, and Q. Tian, "An efficient sequential approach to tracking
multiple objects through crowds for real-time intelligent CCTV systems," IEEE Transactions on Systems, Man,
and Cybernetics, Part B: Cybernetics, pp. 1254-1269, 2008.
284 | P a g e
[11] D N Verma, Sahil Narang, Bhawna Juneja, “Texture based Image Retrieval Using Correlation on Haar Wavelet Transform”, Janahan Lal Stephen (Ed.): CNC 2012, LNICST 2012, pp. 81–86. © Institute for
Computer Sciences, Social Informatics and Telecommunications Engineering 2012.
[12] Kekre, H. B., Sudeep D. Thepade, and Akshay Maloo. "Performance Comparison of Image Retrieval Using
Fractional Coefficients of Transformed Image Using DCT, Walsh, Haar and Kekre’s Transform."
CSC-International Journal of Image processing (IJIP) 4.2 (2010): 142-155.
[13] Hu, Ming-Kuei. "Visual pattern recognition by moment invariants." Information Theory, IRE Transactions
on 8.2 (1962): 179-187.
[14] Reddy, BV Ramana, et al. "Classification of Textures Based on Features Extracted from Preprocessing
Images on Random Windows." jiP 1 (2009):
[15] M. Kristan, J. Pers, S. Kovacic and A. Leonardis, A local-motion-based probabilistic model for visual
tracking, ELSEVIER, Pattern Recognition, vol. 40, pp. 2160–2168, 2009.
[16] A. Gilbert and R. Bowden, Incremental, scalable tracking of objects inter camera, ELSEVIER, Computer
Vision and Image Understanding, vol. 111, pp. 43-58, 2008.
[17] I. Haritaoglu, D. Harwood and L. S. Davis, Hydra: Multiple People Detection and tracking using
Silhouettes, IEEE, 1999.
[18] Dong-Wook Seo, Hyun-Uk Chae, Byeong-Woo Kim, Won-Ho Choi and Kang-Hyun Jo, Human Tracking
based on Multiple View Homography, Journal of Universal Computer Science, vol. 15, no. 13, pp. 2463-2484,
2009.
[19] Zhenghao Xi, Dongmei Xu, Wanqing Song, Yang Zheng, A* algorithm with dynamic weights for multiple
object tracking in video sequence, Optik - International Journal for Light and Electron Optics, Volume 126,
Issue 20, October 2015, Pages 2500-2507, ISSN 0030-4026, http://dx.doi.org/10.1016/j.ijleo.2015.06.020.
[20] Hamidreza Jahandide, Kamal Mohamedpour, Hamid Abrishami Moghaddam, A hybrid motion and
appearance prediction model for robust visual object tracking, Pattern Recognition Letters, Volume 33, Issue 16,