net/publication/261453053
CITATION READS
1 40
2 authors:
Some of the authors of this publication are also working on these related projects:
Development of a System for Detecting Security Threats for Persons with a Vision Impairment, Including Threats Caused by Traffic. Legal and Technical Aspects. View project
All content following this page was uploaded by Mikołaj Igor Leszczuk on 07 April 2019.
Abstract
The Quality of Experience (QoE) concept for video content used for entertain-
ment differs significantly from the QoE of surveillance video used for recognition
tasks. Consequently, the latter requires specific reference video database sets for
research. In this paper, such a database is described. The paper reports the process
of preparing test video sequences and related subjective scores. It also presents
methods used to form visual sequences and collections of accompanying subjective
scores. The resulting data is freely available (via a public library) to the research
community, which is common practice in research.
CCTV, free content database, objective evaluation, recognition video quality, coding
impairments
1 Introduction
Users of video to perform tasks require sufficient video quality to recognize the in-
formation needed for their application. Therefore, the fundamental measure of video
quality in these applications is the success rate of these recognition tasks, which is
referred to as visual intelligibility or acuity. One of the major causes of reduction of
visual intelligibility is loss of data through various forms of compression. Additionally,
the characteristics of the scene being captured have a direct effect on visual intelligi-
bility and on the performance of a compression operation-specifically, the size of the
target of interest, the lighting conditions, and the temporal complexity of the scene
[1]. For example, low resolution of images can have influence on reliability of face
detection and recognition [2].
Consequently, there is a need for developments in quality assessment for target
recognition video, which is including performed series of tests to study the effects and
1
interactions of compression and scene characteristics. An additional goal is to test
existing and develop new objective measurements [1].
The Quality of Experience (QoE) concept for surveillance video used for recog-
nition tasks requires specific reference video and ground truth database sets for re-
search. Currently available task recognition databases (such as those prepared by
the Video Quality in Public Safety Working Group) frequently miss ground truth,
therefore they are insufficient for the research. Consequently, this paper proposes a
ground truth database. The paper describes the process of preparing test video se-
quences and related subjective scores. It also presents methods used to form visual
sequences and collections of accompanying subjective scores. The resulting data is
freely available (via a public library) to the research community [3] or at our webpage
http://vq.kt.agh.edu.pl/videoLibrary.html.
The remainder of this article is organized as follows. Section 2 describes source
video sequences. Section 2 presents Source Reference Circuits (SRC). Section 3 dis-
cusses Hypothetical Reference Circuits (HRC) related to this research. In Section 4, we
present Processed Video Sequences (PVS), while in Section 5 – we present a psycho-
physical experiment. Possible applications for state of the art research are provided in
Section 6. The paper is concluded in Section 7.
2
Figure 1: Example frame.
3
SRC
1280x720
Scale down
960x576
4
a monotonically increasing amount of information, because higher bit-rates indicate
that more information is being sent. The second advantage is that if a model predicts
the needed bit-rate for a particular detection probability, it can be used to optimize the
network utilization.
Moreover, if the network link has limited bandwidth the detection probability as a
function of a bit-rate computes the detection probability, what can be the key informa-
tion which could be crucial for a practitioner to decide whether the system is sufficient
or not.
One important conclusion is that for a bit rate as low as 180 kbit/s the detection
probability is over 80% even if the visual quality of the video is very low. Moreover,
the detection probability depends strongly on the SRC (over all detection probability
varies from 0 (sic!) to over 90%) [1, 6].
Please see [1, 4, 7, 6] for more details related to the experiment results.
5
Finally, in the paper [12], Ukhanova et al. show a related objective quality metric
that considers frame rate. The proposed metric uses PSNR, frame rate and a content-
dependent parameter that can easily be obtained from spatial and temporal activity
indexes. The results have been validated on data from a subjective quality study.
7 Conclusions
In this paper, a specific reference video database sets for research have been described.
The paper has reported the process of preparing test video sequences and related sub-
jective scores. It also has presented methods used to form visual sequences and collec-
tions of accompanying subjective scores. The resulting data is freely available (via a
public library) to the research community, which is common practice in research.
8 Acknowledgments
The authors would like to thank the European Commission’s Seventh Framework Pro-
gram (FP7/2007-2013), from which funding has been received from under grant agree-
ment №218086 (INDECT) to the research leading to these results. Preparation of
recording has been co-financed by the European Regional Development Fund under
the Innovative Economy Operational Program, INSIGMA project №POIG.01.01.02-
00-062/09.
References
[1] M. Leszczuk and J. Dumke, “Survey of recent developments in quality assessment
for target recognition video,” in Multimedia Communications, Services and
Security, ser. Communications in Computer and Information Science, A. Dziech
and A. Czyżewski, Eds. Springer Berlin Heidelberg, 2013, vol. 368, pp. 59–70.
[Online]. Available: http://dx.doi.org/10.1007/978-3-642-38559-9 6
[3] The Consumer Digital Video Library, Institute for Telecommunication Sciences,
http://www.cdvl.org/, June 2013.
[4] M. Leszczuk, L. Janowski, P. Romaniak, A. Głowacz, and R. Mirek, “Quality
assessment for a licence plate recognition task based on a video streamed in
limited networking conditions,” in Multimedia Communications, Services and
Security, ser. Communications in Computer and Information Science, A. Dziech
6
and A. Czyżewski, Eds. Springer Berlin Heidelberg, 2011, vol. 149, pp. 10–18.
[Online]. Available: http://dx.doi.org/10.1007/978-3-642-21512-4 2
[5] Video Quality, AGH University of Science and Technology, http://vq.kt.agh.edu.
pl/, June 2013.
7
(a) “Full frame + kept scale + compressed”
(c) “Cropped frame + kept scale + compressed” (d) “Cropped frame + scaled
down + compressed”
Figure 3: Example frames of four SRC versions (with relative sizes maintained).