A Practical High Efficiency Video Coding Solution for Visual Sensor Network using Raspberry Pi Platform44974

2018 IEEE 12th International Symposium on Embedded Multicore/Many-core Systems-on-Chip A Practical High Efficiency Video Coding Solution for Visual Sensor Network using Raspberry Pi Platform Thao Nguyen Thi Huong1, Huy Phi Cong1, Tien Vu Huu1, Xiem HoangVan2 PTIT – Posts and Telecommunications Institute of Technology VNU – University of Engineering and Technology thaonth@ptit.edu.vn; huypc@ptit.edu.vn; tienvh@ptit.edu.vn; xiemhoang@vnu.edu.vn Abstract remote locations for further content analysis and distribution However, in a VSN, sensor nodes usually have limited processing capabilities and power budget This constrains naturally requires lightweight video signal processing and compression algorithms for individual sensor nodes At the same time, the restriction of the transmission bandwidth in a VSN also asks for an efficient video compression solution which must be used at each sensor node These two requirements are critical to achieve a practical VSN system Video coding aims to reduce the size of video data by exploiting the spatial, temporal and statistical correlation of video and the human visual system characteristics The current video coding standards, such as H.264/AVC [3] or High Efficiency Video Coding (HEVC) [4] can drastically reduce the size of transmitted video data while still guaranteeing the acceptable decoded information at the receiver HEVC is the most recent video coding standard, which provides around 50% of bitrate reduction in comparison with the widely deployed H.264/AVC standard [3] while preserving the same subjective quality However, the achievement of compression efficiency of HEVC usually associates to a large number of coding modes and selection process, i.e 35 directional intra predictions, expensive motion estimation process This may restricts the use of video compression engine in a practical VSN In this context, we present a practical, low complexity HEVC solution for visual sensor network using the common Raspberry Pi platform [5] The low complexity characteristic is achieved by using an appropriate HEVC compression profile as described later The Raspberry Pi platform is chosen as it is popular, low cost and be able to play the role of sensor nodes in a visual sensor network HEVC Test Model (HM) reference software [6] is used to provide implementation of HEVC encoder Visual sensor network (VSN) has recently emerged as a promising solution for tremendous range of new vision-sensor based applications, from video surveillance, environmental monitoring to remote sensing However, the practical VSN currently faces to the visual processing and transmitting problems due to the limitation of power at sensor nodes and the restriction of transmission bandwidth In this context, the selection of a suitable video compression algorithm is utmost important task for achieving a practical VSN To address this problem, this paper introduces a practical Raspberry Pi based High Efficiency Video Coding (HEVC) solution for visual sensor networks The selected video coding solution is one of the most up-to-date compression engines but still achieving the low complexity capability Experimental results show that the proposed video coding architecture has good compression performance with acceptable complexity performance Keywords: Visual sensor network, Raspberry Pi, HEVC Introduction Nowadays, Visual Sensor Networks (VSNs) [1, 2] plays an important role in the era of Internet of Things A VSN typically consists of a large number of sensor nodes, i.e., cameras VSNs have been successfully applied in many applications such as video surveillance and security system where a network of nodes can identify and track objects from their visual information, i.e., video Such networks are made up of multiple cameras capable of capturing visual information from their surrounding environment, performing simple processing on the captured data and transmitting the captured data to 978-1-5386-6689-0/18/$31.00 ©2018 IEEE DOI 10.1109/MCSoC2018.2018.00022 64 To achieve this objective, the rest of the paper is organized as follows Section gives a brief overview of visual sensor network and some related works Section presents the selected video compression solution and describes the Raspberry Pi platform Afterwards, Section provides the performance evaluation of the proposed video coding solution including compression performance and encoding complexity assessments Finally, Section gives some conclusions and remarks for future works HEVC The works in [9, 10] implemented traditional video coding codec such as H.264/AVC on low complexity devices Both approaches brought fantastic results with low delay under the constraints of typical low complexity devices Proposed Video Compression Platform 3.1 Proposed Video Coding based VSN The overall Raspberry Pi based low complexity HEVC architecture is illustrated in Fig 2 Related Work A VSN usually consists of tiny visual sensor nodes such as camera sensors, which integrate the image sensor, embedded processor, and wireless transceiver Fig.1 illustrates an example of a VSN in which consists of hundreds of camera nodes and a base station (BS) YUV Video sequence CSN CSN CSN CSN HEVC Video Stream Video Sensor Network HEVC Decoder Base station Fig Proposed video coding architecture In this case, the Raspberry Pi platform plays the role of sensor node in a VSN Raw video sequences are fed into a Raspberry Pi platform to be encoded This Raspberry Pi platform will produce the video bitstream using most recent HEVC standard HEVC bitstream is transmitted in VSN to base station, a higher complexity device (a computer in this case), and further processing CSN CSN HEVC Encoder Raspberry Pi module Video bit stream Base station 3.2 Raspberry Pi Platform CSN: Camera Sensor Node Raspberry Pi is an embedded platform running the Linux operating system manufactured in UK with the purpose of inspiring the teaching of basic computer science in education institute [5] In this research, the most recent Raspberry Pi model is used Fig illustrates the Raspberry Pi platform Fig An example of visual sensor networks In the VSN architecture, camera nodes capture the visual data, process and transmit valuable video information to the BS for further analysis Usually, camera nodes have small sizes and require long lifetime of battery Meanwhile, they must perform visual data processing and communicating, which are very computationally expensive, in a limited bandwidth condition Therefore, the data collected by sensor node should be compressed at each sensor node before sending to the destination However, this is not easy because traditional video codec is usually designed for broadcasting (one – to – many) applications in which the encoder is much more complex than the decode This requirement naturally is reversed with the VSN which follows a many – to – one information flow In the literature, some works focus on solutions for the communication of video data on devices with limited hardware resources For example, distributed video coding architectures were proposed in [7, 8] for low complexity VSN requirement Although experimental results showed that this is a potential direction but there is a gap in compression efficiency between the distributed video coding solution and the current video coding standards, e.g., H.264/AVC or Fig Selected raspberry pi model The Raspberry Pi features built around the Broadcom BCM2837 processor including CPU, GPU, audio/video processor and other features all integrated into this low-power chip The Raspberry Pi has a Camera Serial Interface (CSI) connector to attach a camera module directly to the Broadcom Video Core Graphics Processing Unit (GPU) using the CSI protocol Being small as a credit card, Raspberry Pi still has the capabilities of 65 working as a normal computer, it can play 1080p resolution video without lagging However, Raspberry Pi cannot completely replace a computer A disadvantage of Raspberry Pi device is that it does not support Windows operation system but it can run on Linux with utilities including web, desktop environment, and other tasks In addition, the Raspberry Pi has a low price as compared to a computer and it requires much low power which is a necessary feature in sensor networks number of intra modes, more efficient coding techniques are required for mode coding in HEVC An important feature in HEVC is fast encoding mode When the number of intra prediction modes is increased, the rate-distortion (RD) optimization process is more complex To solve this problem, HEVC introduces a fast encoding algorithm for a large set of prediction candidates Experiments performed by the official HM 6.0 reference software [6] show that fast encoding algorithm can reduce three times the encoding time with a slight coding gain reduction In other words, HM 6.0 encoder can provide a better compromise between coding efficiency and complexity 3.3 HEVC Low Complexity Profile The first version of the HEVC standard was finalized in January 2013 to fulfill emerging video resolution and quality requirements in traditional video broadcasting, tele-conferencing and mobile applications The HEVC standard still adopted the hybrid predict and transform coding architecture, which has been widely used in traditional video coding standards from H.261 [11] In HEVC, the correlation between consecutive frames is mainly exploited in Inter coding modes while the spatial correlation between samples inside each frame is exploited in Intra coding modes As reported, the HEVC Inter coding significantly outperforms the HEVC Intra coding in terms of the compression performance However, due to the large number of computations associated to the motion estimation process, the HEVC Inter coding profile may not be adopted in video applications with the low complexity requirement HEVC Intra coding contains several improvement elements when compared to the prior H.264/AVC Intra coding solution The novelties of the HEVC Intra coding [12] are specified as the following 1) Larger and flexible coding block size: The size of Coding Tree Unit (CTU) in HEVC can have up to 64×64 pixels in order to exploit better spatial correlation, especially for high definition picture, and better adaptation to different video content 2) Angular prediction with 33 prediction directions: When large block sizes are used, more prediction directions help predict accurately directional structures in video content 3) Removing intra artifact by using boundary smoothing: removing the discontinuities along block boundaries introduced by intra prediction 4) Removing intra artifact by using reference sample smoothing: depending on the block size and prediction mode to reduce the contouring artifacts 5) Block size-dependent transform selection: HEVC utilizes intra mode dependent transforms and coefficient scanning for coding the residual information 6) Intra mode coding based on contextual information: Due to the substantially increased Experimental Results 4.1 Test Methodology In order to evaluate the proposed video coding architecture, the common rate – distortion (RD) performance and the complexity performance are used [13] RD performance metric represents the relationship between the bitrate (i.e., kbps) needed and the peak-signal-to-noise ratio (PSNR) (dB) achieved For the same bitrate, the higher the PSNR, the better the quality of the frame achieved In other words, RD performance shows the quality of the encoded video sequence The second metric, complexity performance is time consuming for encoding In addition, in order to evaluate the feasibility of the proposed architecture on Raspberry Pi, results on Raspberry are compared to results on Personal Computer (PC) The basic configurations of Raspberry Pi and PC are shown in Table Table Configuration of Raspberry and PC Specification CPU type/speed RAM size OS Raspberry Pi ARM CortexA53, 1.2GHz 1GB SRAM Raspbian PC Intel Core i5, 2.6 GHz 4GB SRAM Ubuntu Fig The first frames in test sequences: RaceHorses, BasketballDrill, BQMall and PartyScene Table Characteristics of test video sequences Test sequences RaceHorses BasketballDrill BQMall PartyScene 66 Spatial resoluti on Temporal resolution 30Hz 50Hz Number of frames 300 500 60Hz 50Hz 600 500 832x480 QP 7,17,2 7,37,4 In this implementation, four common video sequences are used for assessment including RaceHorses, BasketballDrill, BQMall and PartyScene with the characteristics summarized in Table These sequences were selected for their representativeness of motion and texture characteristics Each sequence is assessed for five RD points corresponding quantization parameters (QP) 7, 17, 27, 37, 47 The first frames of each sequence are illustrated in Fig of two platforms are similar but the encoding time is different due to the configuration difference of two platforms In particularly, RD performance results for four test video sequences are presented in Table and visualized in Fig The results from Fig.5 and Table show that the quality of the decoded video is decreased when the QP value is increased However, at the middle QP value 27, the quality of video is still at high level (PSNR value is around 37dB) Therefore, the QP value 27 can be considered as the most suitable selection in term of RD performance for coding video on Raspberry Pi in visual sensor network Table RD performance of test video sequences RaceHorses BasketballDrill BQMall PartyScene QP 17 27 37 47 17 27 37 47 17 27 37 47 17 27 37 47 Bitrate (kbps) 53748.58 25379.81 9345.58 2803.44 574.37 89934.62 38533.27 12277.56 3668.84 1022.31 106343.36 45068.94 14626.96 5023.23 1392.92 118482.63 65960.88 27871.62 9228.99 1701.17 PSNR (dB) 55.13 46.06 38.80 32.19 27.05 55.14 45.58 38.49 32.82 27.57 55.20 45.49 38.98 32.90 27.10 55.39 45.49 36.69 29.34 23.38 b Complexity performance The Fig illustrates the comparison between these two platforms while Table shows the differential percentage In this case, the differential percentage is computed as Equation (1): ( ) 100% (1) = Where DP is differential percentage, ETRB and ETPC are encoding time of Raspberry and PC, respectively 1800 1600 1400 Encoding Time (s) Sequence QP = QP = 17 QP = 37 QP = 47 QP = 27 1200 1000 800 600 400 200 Fig Encoding time comparison between Raspberry Pi (RB) and Personal Computer (PC) for each test video sequence Table Differential percentage of encoding time between Raspberry and PC QP 17 27 37 47 Fig RD performance of video test sequences BasketBall 38.58 43.51 45.58 47.17 48.99 PartySene 38.62 40.57 43.67 46.26 49.55 RaceHorse 39.66 41.23 42.91 45.16 47.25 BQMall 36.91 40.22 42.94 44.15 46.99 The results show that the encoding time of Raspberry is always higher than encoding time of PC However, the encoding time difference is in proportional to the QP Therefore, in terms of encoding time, the QP 27 is also the most suitable selection for Raspberry Pi 4.2 Performance Evaluation a Compression performance In this experiment, both Raspberry Pi and PC use the same video test sequences and profile test and experimental results showed that RD performances 67 11 T Turletti, “H.261 software codec for videoconferencing over the Internet”, in Rapports de Recherche 1834, Insitut National de Recherche en Informatique et en Automatique (INRIA), SophiaAntipolis, France, Jan 1993 12 J Lainema et al., “Intra Coding of the HEVC Standard”, IEEE transactions on circuits and systems for video technology, vol.22, no.12, pp 1792-1801, Dec 2012 13 Z Kotevski and P Mitrevski, “Experimental Comparison of PSNR and SSIM Metrics for Video Quality Estimation”, In Proceedings of ICT International Conference, Innovations-09 Macedonian Society on Information and Communication Technologies, Ohrid, Macedonia, pp 357-366, 2009 In summary, the disadvantage of video codec implementation on Raspberry Pi platform is to take a higher time consuming compared to PC However, the advantage of the Raspberry Pi is the compact and low cost Therefore, if the performance of Raspberry Pi is improved in the future, it can be considered as suitable platform for video sensor network application Conclusion This paper presents a Raspberry Pi based HEVC platform for visual sensor networks The results have shown that our platform can achieve good compression ratio with moderate computational complexity even in the case of encoding high resolution video sequences and this satisfies the stated requirements for sensor nodes in visual sensor networks Our future work is performing further comprehensive assessments for different video compression algorithms on various low complexity devices such as Raspberry Pi Zero, smartphones References Y Charfi et al., “Challenging issues in visual sensor networks”, IEEE Wireless Communications, vol 16, no 2, pp 44-49, Apr 2009 S Soro and W Heinzelman, “A Survey of Visual Sensor Networks”, Advances in Multimedia, vol 2009, pp 1-21, May 2009 T Wiegand et al, ‘Overview of the H.264/AVC video coding standard’, IEEE Transactions on Circuits and Systems for Video Technology, vol 13, no 7, pp 560-576, 2003 G.J.Sullivan et al, “Overview of the high efficiency video coding (HEVC) standard”, IEEE Transactions on circuits and system for video technology, vol.22, no.12, pp 1649–1668, Dec 2012 Raspberry Pi Org [Online], Available: http://www.raspberrypi.org HM reference software, https://hevc.hhi.fraunhofer.de/svn/svn_HEVCSoftwa re/ R Puri and K Ramchandran, “PRISM: A new robust video coding architecture based on distributed compression principles,” in Proceedings of the 40th Allerton Conference Communication, Control, and Computing, Allerton, IL, pp 301-304, Oct 2002 B Girod et al., “Distributed Video Coding,” in Proceedings of the IEEE, vol 93, no 1, pp 71-83, Jan 2005 R Pereira and E Pereira, “Video Streaming: H.264 and the Internet of Things”, in Proceedings of the 2015 IEEE 29th International Conference on Advanced Information Networking and Applications Workshops (WAINA), Gwangju, Korea, pp 711–714, Mar 2015 10 U Jennehag, S Forsstrom and F.V Fiordigigli, “Low Delay Video Streaming on the Internet of Things Using Raspberry Pi” Electronics, vol 5, no 3, pp 1-11, Sep 2016 68 ... recent Raspberry Pi model is used Fig illustrates the Raspberry Pi platform Fig An example of visual sensor networks In the VSN architecture, camera nodes capture the visual data, process and transmit... advantage of the Raspberry Pi is the compact and low cost Therefore, if the performance of Raspberry Pi is improved in the future, it can be considered as suitable platform for video sensor network. .. into a Raspberry Pi platform to be encoded This Raspberry Pi platform will produce the video bitstream using most recent HEVC standard HEVC bitstream is transmitted in VSN to base station, a higher

Định dạng
Số trang	5
Dung lượng	407,75 KB