0
Views
0
CrossRef citations to date
0
Altmetric
Research Article

A fast H.266/QTMT intra coding scheme based on predictions of learned models

, , &
Received 25 Oct 2023, Accepted 11 Apr 2024, Published online: 19 Jul 2024
 

ABSTRACT

H.266/VVC adopts a QuadTree-plus-MultiType tree (QTMT) coding-unit (CU) split structure to improve efficiency at the cost of high time complexity. Speeding up VVC coding while minimizing quality degradation is critical for practical applications. We propose predicting the coding depth and split type of an optimally coded 32 × 32 CU (CU32 × 32) to perform only a subset of exhaustive rate-distortion optimization (RDO) operations: (1) To predict the depth of an optimally coded CU32 × 32, we train a convolutional neural network (CNNdepth). CNNdepth outputs a label specifying a depth range subset by which the controller can execute EarlySkip or EarlyTerminate to reduce time complexity. (2) To predict the split type, we train random forest classifiers (RFCtype). The corresponding RDO operations can be omitted if the RFCtype classifies one CU32 × 32 as not of a specified split type. Experiments show that CNNdepth and RFCtype work seamlessly, reducing execution time by up to 69% and 39.16% on average, with a 0.7% increase in BDBR compared to the default VTM-7.0. Additionally, the proposed method yields the highest balanced time reduction rate of 61.5%.

CO EDITOR-IN-CHIEF:

ASSOCIATE EDITOR:

Nomenclature

BTD/QTD=

Binary-tree/quadtree split depth

CCS=

Coding control system

CUn×n=

One CU with size n×n

CUk=

K-th Coding Unit (32 × 32)

CUlow/CUhigh=

One CU from a low/high-resolution video

CNNdepth(CUk)=

Depth prediction model for a cuk

classdepth(CUk)=

The largest BTD in a cuk

cyi=

The center feature vector of the yi-th class (EquationEquation 4)

DT=

Decision tree

ES/ET=

Early stop/early termination mode

HTT/VTT=

Horizontal/vertical ternary tree split

HBT/VBT=

Horizontal/vertical binary tree split

K=

K: dimension of a label vector (EquationEquation 4)

i=

Label of classdepth

ℓopt=

The predicted label by cnndepth (cuk)

m=

Batch size in the deep learning process. (EquationEquation 4)

mi=

The i-th classifier output of rfcmode

NO=

Not split mode

QP=

Quantizer parameter

QTMT=

Quadtree plus multitype tree

RDC=

Rate-distortion cost

RDO=

Rate-distortion optimization

RFCtype(CUk)=

The split type classification model

rfcqt=

Binary classifier to perform QT or not

ratiousefulmi=

Ratio of helpful classification of type mi

UVG266=

A VVC encoder based on the Kvazaar HEVC encoder

VTM=

VVC Test Model (reference software)

VVenC=

Fast VVC encoder implementation

Wj,W=

WjRd the j-th column vector of the weight matrix WRd×n in the last fully connected layer (EquationEquation 4)

xiRd=

The i-th input feature vector (EquationEquation 4)

yi and yiLS=

One-hot and smoothed label vectors of xi (EquationEquation 5)

ykiyi=

The k(i) is the index of correct label of xi And yki = 1 (EquationEquation 5)

∆T andT=ΔTBDBR=

Time reduction rate and its balanced one

Acknowledgment

The authors express their gratitude to the reviewers for their insightful comments and suggestions, which have greatly improved this manuscript. Special thanks to Mr. Chong-How Fan for his assistance with editing and formatting during the revision process.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

We also acknowledge the support from the Taiwan NSTC project [NSTC 112-2221-E-011-113] for funding our research.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 199.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.