<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	 xmlns:media="http://search.yahoo.com/mrss/" >

<channel>
	<title>Ekrem Cetinkaya &#8211; Bitmovin</title>
	<atom:link href="https://bitmovin.com/author/ekrem-cetinkaya/feed" rel="self" type="application/rss+xml" />
	<link>https://bitmovin.com</link>
	<description>Bitmovin provides adaptive streaming infrastructure for video publishers and integrators. Fastest cloud encoding and HTML5 Player. Play Video Anywhere.</description>
	<lastBuildDate>Mon, 09 Jan 2023 10:48:59 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	

<image>
	<url>https://bitmovin.com/wp-content/uploads/2023/11/bitmovin_favicon.svg</url>
	<title>Ekrem Cetinkaya &#8211; Bitmovin</title>
	<link>https://bitmovin.com</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>ATHENA Lab: Fast Multi-Resolution and Multi-Rate Encoding for HTTP Adaptive Streaming Using Machine Learning (FaRes-ML)</title>
		<link>https://bitmovin.com/multi-rate-encoding-fares-ml</link>
		
		<dc:creator><![CDATA[Christian Timmerer]]></dc:creator>
		<pubDate>Wed, 14 Jul 2021 11:49:13 +0000</pubDate>
				<category><![CDATA[Innovation]]></category>
		<category><![CDATA[athena lab]]></category>
		<category><![CDATA[video encoding]]></category>
		<guid isPermaLink="false">https://bitmovin.com/?p=179400</guid>

					<description><![CDATA[<p>The heterogeneity of the devices on the Internet and the difference among the network conditions of the users make designing a video delivery tool that can adapt to all these differences while maximizing the quality of experience (QoE) for each user a tricky problem. HTTP Adaptive Streaming (HAS) is the de-facto solution for video delivery...</p>
<p>The post <a rel="nofollow" href="https://bitmovin.com/multi-rate-encoding-fares-ml">ATHENA Lab: Fast Multi-Resolution and Multi-Rate Encoding for HTTP Adaptive Streaming Using Machine Learning (FaRes-ML)</a> appeared first on <a rel="nofollow" href="https://bitmovin.com">Bitmovin</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p><img fetchpriority="high" decoding="async" class="aligncenter size-large wp-image-179408" src="https://bitmovin.com/wp-content/uploads/2021/07/BLOG-POST_FaRes-ML-1024x537.png" alt="- Bitmovin" width="1024" height="537" srcset="https://b3148424.smushcdn.com/3148424/wp-content/uploads/2021/07/BLOG-POST_FaRes-ML-300x157.png?lossy=2&amp;strip=1&amp;webp=1 300w, https://b3148424.smushcdn.com/3148424/wp-content/uploads/2021/07/BLOG-POST_FaRes-ML.png?size=384x201&amp;lossy=2&amp;strip=1&amp;webp=1 384w, https://b3148424.smushcdn.com/3148424/wp-content/uploads/2021/07/BLOG-POST_FaRes-ML-768x402.png?lossy=2&amp;strip=1&amp;webp=1 768w, https://b3148424.smushcdn.com/3148424/wp-content/uploads/2021/07/BLOG-POST_FaRes-ML-1024x537.png?lossy=2&amp;strip=1&amp;webp=1 1024w, https://b3148424.smushcdn.com/3148424/wp-content/uploads/2021/07/BLOG-POST_FaRes-ML.png?lossy=2&amp;strip=1&amp;webp=1 1080w" sizes="(max-width: 1024px) 100vw, 1024px" /><br />
<span style="font-weight: 400;">The heterogeneity of the devices on the Internet and the difference among the network conditions of the users make designing a video delivery tool that can adapt to all these differences while maximizing the quality of experience (QoE) for each user a tricky problem. </span><a href="https://bitmovin.com/adaptive-streaming/"><span style="font-weight: 400;">HTTP Adaptive Streaming</span></a><span style="font-weight: 400;"> (HAS) is the de-facto solution for video delivery over the Internet. In HAS, multiple representations are stored for each video, with each representation having a different quality level and/or resolution. This way, HAS streaming sessions can alternate between different quality options based on the network and viewing conditions while delivering the content. However, the requirement to store multiple representations for a single video in HAS brings additional encoding challenges since the source video needs to be encoded efficiently at multiple bitrates and resolutions. Multi-Rate encoding aims to tackle this problem.&nbsp;</span><br />
<span style="font-weight: 400;">This blog post introduces our new approach to multi-rate encoding, called FaRes-ML, </span><b>Fa</b><span style="font-weight: 400;">st Multi-</span><b>Res</b><span style="font-weight: 400;">olution and Multi-Rate Encoding for HTTP Adaptive Streaming Using </span><b>M</b><span style="font-weight: 400;">achine</span><b> L</b><span style="font-weight: 400;">earning (FaRes-ML). But first&#8230;</span></p>
<h2><span style="font-weight: 400;">What is Multi-Rate Encoding?</span></h2>
<p><span style="font-weight: 400;">In multi-rate encoding, a single source video needs to be encoded at multiple bitrates and resolutions in order to provide a suitable representation for a variety of network and viewing conditions. The quality level of the encoded video is controlled by the quantization parameter (QP) in the encoder. An example multi-rate encoding scheme is given in Fig.1.</span></p>
<figure id="attachment_179405" aria-describedby="caption-attachment-179405" style="width: 800px" class="wp-caption aligncenter"><img decoding="async" class="size-full wp-image-179405" src="https://bitmovin.com/wp-content/uploads/2021/07/MultiRate.gif" alt="Multi-Rate Encoding workflow_animated gif" width="800" height="450"><figcaption id="caption-attachment-179405" class="wp-caption-text">Multi-Rate Encoding workflow</figcaption></figure>
<p><span style="font-weight: 400;">This is a computationally expensive process due to the high data size of videos and the high complexity of video codecs. However, since all of these representations consist of the same content, there is a nice amount of redundancy. Multi-rate encoding approaches exploit this redundancy to speed up the encoding process.</span><br />
<span style="font-weight: 400;">In multi-rate encoding, a representation is chosen as the </span><i><span style="font-weight: 400;">reference representation </span></i><span style="font-weight: 400;">(usually the highest [1] or the lowest quality [2] representation),</span> <span style="font-weight: 400;">and its information is used to speed up the remaining </span><i><span style="font-weight: 400;">dependent</span></i><span style="font-weight: 400;"> representations. Since block partitioning is one of the most time-consuming processes in the encoding pipeline, a majority of the multi-rate encoding approach focuses on speeding up this portion of the process.</span><br />
<span style="font-weight: 400;">In block partitioning, each frame is divided into smaller pieces called </span><b><i>blocks</i></b><span style="font-weight: 400;"> to achieve more precise motion compensation. Smaller block sizes are used for motion intense areas while larger block sizes are used for stationary areas.&nbsp;</span><br />
<a href="https://bitmovin.com/developer-network/lesson-1-9-high-efficiency-video-coding-hevc"><i><span style="font-weight: 400;">High-Efficiency Video Coding</span></i></a><span style="font-weight: 400;"> (HEVC) standard uses a Coding Tree Unit (CTU) for block partitioning. By default, each CTU covers a 64&#215;64 pixels-sized square region and each CTU can be divided recursively up to three times with the smallest block size being 8&#215;8 pixels. Each split operation increases the depth level by 1 (</span><i><span style="font-weight: 400;">i.e. </span></i><b><i>depth 0</i></b><span style="font-weight: 400;"> for </span><b>64&#215;64</b><span style="font-weight: 400;"> pixels and </span><b><i>depth 3</i></b><span style="font-weight: 400;"> for </span><b>8&#215;8 </b><span style="font-weight: 400;">pixels). An example of block partitioning for a frame is illustrated in Fig.2.</span></p>
<figure id="attachment_179410" aria-describedby="caption-attachment-179410" style="width: 800px" class="wp-caption aligncenter"><img decoding="async" class="size-full wp-image-179410" src="https://bitmovin.com/wp-content/uploads/2021/07/Block-partioning-in-Multi-rate-Encoding_animated-gif-example.gif" alt="Block partioning in Multi-rate Encoding_animated gif example" width="800" height="450"><figcaption id="caption-attachment-179410" class="wp-caption-text">Block partitioning using a CTU</figcaption></figure>
<h2><span style="font-weight: 400;">Introducing the FaRes-ML</span></h2>
<p><span style="font-weight: 400;">FaRes-ML uses </span><a href="https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53" rel="nofollow noopener" target="_blank"><span style="font-weight: 400;">Convolutional Neural Networks</span></a><span style="font-weight: 400;"> (CNNs) to predict the CTU split decision for the dependent representations. The highest quality representation from the lowest resolution is chosen as the reference representation. The reference representation is selected from the lowest resolution to speed up the parallel encoding performance since, in parallel encoding, the highest complexity representation bounds the overall encoding time. Thus choosing the reference from a low resolution can increase the parallel encoding performance.&nbsp;</span><br />
<span style="font-weight: 400;">The encoding process in FaRes-ML consists of three main steps:</span></p>
<ol>
<li style="font-weight: 400;" aria-level="1"><span style="font-weight: 400;">The reference representation is encoded with the HEVC reference encoder. Then, the encoding information obtained is stored to be used while encoding the dependent representations.&nbsp;</span></li>
<li style="font-weight: 400;" aria-level="1"><span style="font-weight: 400;">Once the encoding information is obtained, the pixel values from the source video in corresponding resolution and the encoding information from the reference representation are fed into the CNN for the given quality level and resolution.&nbsp;</span></li>
<li style="font-weight: 400;" aria-level="1"><span style="font-weight: 400;">The output from the CNN is the split decision for the given depth level. This decision is used to speed up the encoding of the dependent representation.</span></li>
</ol>
<p><span style="font-weight: 400;">The overall encoding scheme of FaRes-ML is given in Fig.3.</span></p>
<figure id="attachment_179406" aria-describedby="caption-attachment-179406" style="width: 800px" class="wp-caption aligncenter"><img loading="lazy" decoding="async" class="size-full wp-image-179406" src="https://bitmovin.com/wp-content/uploads/2021/07/FaRes.gif" alt="Fast Multi-rate encoding scheme_animated workflow" width="800" height="450"><figcaption id="caption-attachment-179406" class="wp-caption-text">FaRes-ML Encoding Scheme Workflow</figcaption></figure>
<p><span style="font-weight: 400;">To measure the encoding performance of the FaRes-ML approach, we compared the results to the HEVC reference software (</span><a href="https://vcgit.hhi.fraunhofer.de/jvet/HM/-/tags/HM-16.21" rel="nofollow noopener" target="_blank"><span style="font-weight: 400;">HM 16.21</span></a><span style="font-weight: 400;">) and the lower bound approach [3]. FaRes-ML achieves&nbsp; </span><b>27.71 %</b><span style="font-weight: 400;"> time saving for the </span><b>parallel</b><span style="font-weight: 400;"> encoding and </span><b>46.27%</b><span style="font-weight: 400;"> for the </span><b>overall</b><span style="font-weight: 400;"> encoding while maintaining a minimal bitrate increase (</span><b>2.05 %</b><span style="font-weight: 400;">).</span> <span style="font-weight: 400;">The resulting normalized encoding time graph is given in Fig.4.</span></p>
<figure id="attachment_179404" aria-describedby="caption-attachment-179404" style="width: 512px" class="wp-caption aligncenter"><img loading="lazy" decoding="async" class="size-full wp-image-179404" src="https://bitmovin.com/wp-content/uploads/2021/07/Fast-Multi-Rate-Encoding-efficiency-comparison_FaRes-ML-vs-Lower-Bound-vs-HEVC_Bar-Graph.jpg" alt="Fast Multi-Rate Encoding efficiency comparison_FaRes-ML vs Lower Bound vs HEVC_Bar Graph" width="512" height="288" srcset="https://b3148424.smushcdn.com/3148424/wp-content/uploads/2021/07/Fast-Multi-Rate-Encoding-efficiency-comparison_FaRes-ML-vs-Lower-Bound-vs-HEVC_Bar-Graph-300x169.png?lossy=2&amp;strip=1&amp;webp=1 300w, https://b3148424.smushcdn.com/3148424/wp-content/uploads/2021/07/Fast-Multi-Rate-Encoding-efficiency-comparison_FaRes-ML-vs-Lower-Bound-vs-HEVC_Bar-Graph.jpg?size=384x216&amp;lossy=2&amp;strip=1&amp;webp=1 384w, https://b3148424.smushcdn.com/3148424/wp-content/uploads/2021/07/Fast-Multi-Rate-Encoding-efficiency-comparison_FaRes-ML-vs-Lower-Bound-vs-HEVC_Bar-Graph.jpg?lossy=2&amp;strip=1&amp;webp=1 512w" sizes="(max-width: 512px) 100vw, 512px" /><figcaption id="caption-attachment-179404" class="wp-caption-text">Fast Multi-Rate Encoding efficiency comparison vs Lower Bound vs HEVC</figcaption></figure>
<h2><span style="font-weight: 400;">Conclusion</span></h2>
<p><span style="font-weight: 400;">As the quality of content resolution improves to new heights with 4K+ resolutions becoming the norm, organizations and researchers are finding new ways to improve the back-end delivery technologies to match the content to its respective device. One of the latest approaches to improving the speed of encoding is the FaRes-ML method, a machine learning-based approach that handles multiple representations in different qualities and resolutions. By applying CNNs to exploit the redundant information in the multi-rate encoding pipeline, FaRes-ML is capable of speeding up overall encodings by nearly 50% in ATHENA’s early-stage experiments with additional improvement parallel encoding methods, all while maintaining a minimal bitrate increase.&nbsp;</span><br />
<span style="font-weight: 400;">Although the FaRes-ML method has been proven in lab environments for single and parallel encodes, its potential can be extended to cover even more encoding decisions (e.g., reference frame selection) to further improve the encoding performance in the near future. Furthermore, the extension of the proposed method for recent video codecs such as </span><a href="https://bitmovin.com/compression-standards-vvc-2020/"><i><span style="font-weight: 400;">Versatile Video Coding</span></i><span style="font-weight: 400;"> (VVC)</span></a><span style="font-weight: 400;"> can be interesting due to the increased encoding complexity of recent video encoding standards, which would significantly decrease the amount of time organizations that operate a back-end workflow could implement the brand new codec.</span><br />
<span style="font-weight: 400;">The team at ATHENA will work closely with Bitmovin in the coming months to determine how FaRes-ML works in real-world applications. If you’re interested in learning more about the Fast Multi-Resolution and Multi-Rate Encoding approach, you can find the full study published in the </span><i><span style="font-weight: 400;">IEEE Open Journal of Signal Processing </span></i><span style="font-weight: 400;">journal as an open-access article. More information about the full study can be found in the following links:</span></p>
<ul>
<li style="font-weight: 400;" aria-level="1"><a href="https://ieeexplore.ieee.org/document/9427195" rel="nofollow noopener" target="_blank"><span style="font-weight: 400;">Full paper</span></a><span style="font-weight: 400;"> [PDF]</span></li>
<li style="font-weight: 400;" aria-level="1"><a href="https://athena.itec.aau.at/2021/05/ieee-oj-sp-fast-multi-resolution-and-multi-rate-encoding-for-http-adaptive-streaming-using-machine-learning/" rel="nofollow noopener" target="_blank"><span style="font-weight: 400;">Blog post&nbsp;</span></a></li>
<li style="font-weight: 400;" aria-level="1"><a href="https://athena.itec.aau.at/" rel="nofollow noopener" target="_blank"><span style="font-weight: 400;">More information about the ATHENA project</span></a></li>
</ul>
<p><span style="font-weight: 400;">If you liked this article, check out some of our other great ATHENA content at the following links:</span></p>
<ul>
<li style="font-weight: 400;" aria-level="1"><a href="https://bitmovin.com/scalable-light-field-coding/">Scalable Light Field Coding &#8211; Improving the Quality of Experience (QoE) | Bitmovin x ATHENA Labs</a></li>
<li aria-level="1"><a href="https://bitmovin.com/multicast-live-video-streaming-oscar/">Replacing the Multicast Live Video Streaming Approach with OSCAR | Bitmovin x ATHENA Labs</a></li>
</ul>
<h2><span style="font-weight: 400;">Sources</span></h2>
<p><span style="font-weight: 400;">[1]&nbsp;D. Schroeder, A. Ilangovan, M. Reisslein, and E. Steinbach, “Efficient multi-rate video encoding for HEVC-based adaptive HTTP streaming,” I</span><i><span style="font-weight: 400;">EEE Trans. Circuits Syst. Video Technol.</span></i><span style="font-weight: 400;">, vol. 28, no. 1, pp. 143–157, Jan. 2018.</span><br />
<span style="font-weight: 400;">[2] K. Goswami et al., “Adaptive multi-resolution encoding for ABR streaming,” in </span><i><span style="font-weight: 400;">Proc. 25th IEEE Int. Conf. Image Process.</span></i><span style="font-weight: 400;">, 2018, pp. 1008–1012.</span><br />
<span style="font-weight: 400;">[3] H. Amirpour, E. Çetinkaya, C. Timmerer, and M. Ghanbari, “Fast multi-rate encoding for adaptive HTTP streaming,” in </span><i><span style="font-weight: 400;">Proc. Data Compression Conf.</span></i><span style="font-weight: 400;">, 2020, pp. 358–358..</span></p>
<p>The post <a rel="nofollow" href="https://bitmovin.com/multi-rate-encoding-fares-ml">ATHENA Lab: Fast Multi-Resolution and Multi-Rate Encoding for HTTP Adaptive Streaming Using Machine Learning (FaRes-ML)</a> appeared first on <a rel="nofollow" href="https://bitmovin.com">Bitmovin</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
