`STANDARD
`
`ISO/IEC
`13818-2
`
`Second edition
`2000-12-15
`
`Information technology — Generic coding
`of moving pictures and associated audio
`information: Video
`
`Technologies deI'information — Codage générique des images animées et
`du son associé: Données vidéo
`
`SO Txe
`
`1
`
`Reference number
`ISO/IEC 13818-2:2000(E)
`
`© ISO/IEC 2000
`APPLE-1035
`
`1
`
`APPLE-1035
`
`
`
`ISO/IEC 13818-2:2000(E)
`
`PDFdisclaimer
`
`that a problem relating to it is found, please inform the Central Secretariat at the address given below.
`
`This PDFfile may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but shall not
`be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this
`file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat accepts no liability in this
`area.
`
`Adobe is a trademark of Adobe Systems Incorporated.
`Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation parameters
`were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies.In the unlikely event
`
`© ISO/IEC 2000
`
`All rights reserved. Unless otherwise specified, no part of this publication may be reproduced orutilized in any form or by any means, electronic
`or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISO's member body
`in the country of the requester.
`ISO copyright office
`Case postale 56 « CH-1211 Geneva 20
`Tel. +41 22 7490111
`Fax +41 22749 09 47
`E-mail copyright@iso.ch
`Web www.iso.ch
`Printed in Switzerland
`
`ii
`
`2
`
`© ISO/IEC 2000 — All rights reserved
`
`2
`
`
`
`ISO/IEC 13818-2:2000(E)
`
`Page
`
`Purpose...........
`Intro. 1
`
`Intro. 2 Application...
`Intro.3 Profiles and—
`Intro.4 The scalable and thenon-scalablesyntax...
`
`Normative references ........0.ccccccecccccsscceeessceesceccscccssescseatecusecesusececateesusesessaecssaseesuasecsuseseeaseeeassesuaeessaseensaseesess
`
`Definitions ..
`
`&Whe
`
`obDArerereFPooyyyemomaoaNee8.Ss.gs.3.
`
`4.2—Logical operators ............cccccceccecceccesceecceceescesecsecsceeeceacessceeesaecaeeaacsesneeaeseesaceesaesaeeuaeeaseeeseeeeaesareneteeeenes
`4.3—Relational operators ...............cccccccceeccecceseesceeeceecececseeecensceeesaecaeenacseseescaeseessasseeeaecaeeaseaseeesaeeeaeeaseneteetenes
`A.A=Bitwise operators ..........eccccceccecceccescesceeceeseececsecseeeaeesesnscaeeeaeeaesaesseseeesaeesesasseecaesaceaeesseesseeeaeeaseneteeeenes
`AS
`Assignment......cccsscssssssessssssessessssssessessssvecsssssssesessssiecssssseesesssssecsssisessetssuuecsssisevsssssuuecsssiseectesteeeseesees
`
`5.1 Methodofdescribingbitstreamsyntax...
`5.2
`Definition of functions ..
`5.3
`Reserved, forbiddenandmarker_bit.
`
`ceeeuseseseeecececeesuseseaeecscacecenseseeatecsnseecenseseeateesnseeeensaseess
`
`6.1 —Structure of coded video data 0.0.0.0... cccccccccsccssecssceessesssesseeeseeessecssesssecsssesssecssseecesseesacceasensessasenseeneess
`
`6.2
`6.3
`
`Video bitstream syntax ..............ccccececcescesceecseecececscescessceneaecaeeuaceesensusessetsacseeseesaceaeeaseseseseaeeaeeeteeeenss
`Video bitstream semantics .............ccccccccccssccssessscescecsseessesseeeseecsssessesssecsseesssecsaseeecesseessccsasesessasensseneees
`
`7
`
`The video decoding process...............
`
`seceeceaaeeseeseseeeeaeeaeeeeseteneeaeeestesseness
`
`in =
`
`7.2
`
`Variable length decoding..............c.cccccccccceesceeceeceeseeeceseeseceeeeaeceecsccseeceecaceseeessenecaeeeeteasesscseeeeesiteetessenees
`
`TA—Trverse quantisation... cccecceccesceecceceesceeceeececeeeceseeseseeceaecaetsecscecaeseeeaseaseneseeeeateaseetseseresieenseeseeness
`
`7.6 Motion compensation ..............ccccecceccceceeceeseeeeeseceseeseesecaeceeeeseeaeceeseeesaecaeesacseseeeeaeseteasenecaeseeeeateseesseness
`7.7=Spatial scalability... ..cccccccccccccecccescecceseeeeccceeeeeececeseeessneesecsecaceeeesaesaecaceaseessaeeceteaseeseseseeesieeesenatenees
`7.8
`SNR scalability ...ccccccccccscssssssesssssssssevssssssssesssssseesssssssvecsssssesscessssessssssssstessssueesessnuessetsesaresiueeseesseees
`
` OOTDDAHnnSSBSsgRxeege2
`
`8
`
`7.10 Data partitioning........ccccssssssssssessssvssssesseveessssssessssssesessesssvssessessesssessviseessisssssissseeesseeseeee
`TLL Hybrid scalability ......cccccecscsssssssssssseessessseseesssrseeesesssseesesesssnsetsssssussesssnussseitsssnsssesee
`7.12 Output of the decoding process ............ccccscceccsscceecesceeeeeeeeeeeeeescenecseeaecacesecseseneeaecetaseeesaeseeeeesenseersenees
`Profiles and levels...
`eevee
`ceceneeeeeeeeeeaenessesaeeatiaacneeeneaesaeeeeseaesessaeeeeeeaeeaeesiesateaseatseeeeeeteeteeeeteateess
`8.1
`ISO/IEC 11172.2compatibility...
`aceeeeeeeeeaeeeteaceeeeeeeseaeeeseaeeeeeeeeeaaeesneaeeesaeeseaeeseeaeeeneeeeegaeteneateeaees
`8.2
`Relationship between definedprofiles.--cvevevcvivsvivtivvvrsssee
`8.3
`Relationship between defined levels ...............ccceccccceeceeseecceseeeececeeeeceeeecceeeeseeeeceeeeaecaeeaeseseneeeeseenssenees
`8.4
`Scalablelayers...
`ceceeeecaeeeceeescecaeceseaaeeseseesaeeeseeceeeeseeeseeesenetertenes
`
`Parametervaluesfordefinedprofiles,levelsandlayers..cinvininnnnnnnaannn
`
`8.5
`
`9
`
`Registration of Copyright Identifiers ................ccccceccecceceeseeeceesceceesececenccesceneeaeeceeseeeeeceesaeesaeeaseesseeeeaesereneeestenes
`
`102
`103
`104
`106
`109
`109
`111
`111
`
`114
`
`117
`
`9.2
`
`Implementation of a Registration Authority (RA)........0..:c:ccccceccecesceeeeeseeseeeeceeeeceeeeeeceeeseeeeneeaeeeteeeens
`
`118
`
`© ISO/IEC 2000 — All rights reserved
`
`3
`
`3
`
`
`
`ISO/IEC 13818-2:2000(E)
`
`Annex A — Inverse discrete transform ...........00cccccccccccceessceessseesseceeessseesescecseseessseseseacsesuseeusecersuscecseseeesseeeeeeeeeeceeeaeeees
`
`Amnmex B . Vataable lenpti GGde talleS oe cece ececewscs seer see eesersewesereneeseceease taseeneeeaeeeeme a eeeer ene enter eeremr ene
`
`
`
`Page
`119
`
`121
`
`B.2—Macroblock type ..e.ssscsssssssssessssssesessssssseesssssseesessssssessssssevesseseseeessssunserssessiessssaniisssssesssssesaseeee 122
`
`B.3— Macroblock pattern......s..ssssssssssssssssssssssssesssssseesssssssseesssssevessessssesssssseesessssnseesessenesssssveessssssesssseee
`127
`BA;3Motiorniectors sceneseee 128
`
`
`
`Picture quallity......ccccccssssssssssssssssseesssssssesesssssseessessetietsssssestssesenessssseessssseesseseane
`D.3
`D4 Data rate control ..
`eececeeeeeeenenessacaeeaeecescneesetaesassassaesesaesetseeaeeesiasateateacseeesesaesaeeassaceeeaeeaeeeseeeeeenenetaes
`D.5
`Low delay mode.
`ceceaceacesecensusecsecuacuaessecsecaesaeeessseeseecaceasessseaeeaecaetaasaeseeesaeteeeeaseneseeseeesieeeteeeteness
`D.6 Randomaccess/channelhopping...seeeeees
`ceceeceeceeseeseseeeeaeeaeenetseteeeeeensenetenees
`
`D.7
`Scalability...
`secuesssssssevessssuseessessisecsessueesessiuietssssssessiuetssiiestiseissiesesteeeeeee
`D.8 Compatibility...
`ceceeeeeees
`ceeeenecaeeeeeeseeecsecetesecessenssereeeeeeseneeereeees
`D9 Differences betweenthis SpecificationandISO/IEC111172-2...
`secceesecseeeeeestesetaeseeeeseeseeteesteness
`D.10 Complexity...
`cecsssssustisssssstsssssusecesesuetssisiietssesessssitsseseestiseeene
`
`144
`144
`144
`145
`145
`151
`151
`154
`
`163
`Annex E — Profile and level restrictions ..
`cuceecneeecseeeeeaeeesuasaeeatsauceeeeeenecaeeaeeassesieateaseeenenesaesaseassaneeensneseeteeeatensats
`163
`E.1
`Syntax element restrictionsinprofiles...
`ceeeeeeeaeeacesaesnecaecaeesacessesessececeeseesesaeeeeteaseaeseeseresareeeeetenees
`175
`E.2
`Permissible layer combinations-.-so--scevevsvcveovveevvevvosvisrviisviiiivissisnee
`Annex F~ BADOgrapener 197
`
`Gl
`Procedure for the request ofaRegisteredIdentifier(RID)...
`ceceeeeeseesecsaeseeesesaceseeeatenetaeeeteneseatenes
`198
`G.2_
`Responsibilities ofthe Registration AUthOtityocccvccosvvvvitsuvvvtissteessteseessse
`198
`G.3
`Responsibilities ofparties requesting an RID................:cccceccesceecceceeseeeeesececeeeceeceneeaeesetaeseeeeaeseeeneteeeenes
`198
`
`200
`Annex H — Registration Application Form..
`.
`ceceeceaaeeseeseeeeeeeeeeeneteness
`200
`H.1
`Contact information oforganizationrequestingaRegisteredIdentifier(RID)ceeeeseeaeeeseeseeeetereeseeeeeaees
`200
`H.2
`Statement of an intention to apply the assigned RID................ccccccsccecceseeceeeeeeeeeceeseeseeeeseeeeneteeeneteeeeees
`200
`H.3 Date of intended implementation of the RID.................ccccccceceeceeeeeeeeseeeeeeseeeeceeeeeeeeeesceeeseeeeaeeeseneteeeeees
`
`H4=Authorized representative.... - ceseececaeeecessesnecaeeseesaeesesaeseeeeaeeceesssenecaeeaeesaeensesetenss 200
`
`
`H.5
`For official use only ofthe RegistrationAuthority...
`ceceaceaccaeeesecaceaeeaeececaeseeeaeenseseeeeenetenseeseaeess
`200
`Annex J — 4:2:2 Profile test results .....00...cccccccccccccscceeececcescecsssesesuceceesceeseacsesueseeusceseacecseseesusceeseecececseeesseesaeeeeeseeenatsees
`202
`
`Annex K — Patents...............ccccccccceeeeessssssccceeessseneeecceseeeeessnsesseeeeeseceeecessseeeececeessennneeeeeeeseseeeeseesaaeaeeeceesessssseeeceeesennnnees
`
`207
`
`iv
`
`4
`
`© ISO/IEC 2000 — All rights reserved
`
`4
`
`
`
`ISO/IEC 13818-2:2000(E)
`
`Foreword
`
`ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission)
`form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC
`participate in the development of
`International Standards through technical committees established by the
`respective organization to deal with particular fields of technical activity. ISO and IEC technical committees
`collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in
`liaison with ISO and IEC, also take part in the work.
`
`International Standards are drafted in accordancewith the rules given in the ISO/IEC Directives, Part 3.
`
`In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1.
`Draft International Standards adopted by the joint technical committee are circulated to national bodies for voting.
`Publication as an International Standard requires approval by at least 75 % of the national bodies casting a vote.
`
`Attention is drawn to the possibility that some of the elements of this part of ISO/IEC 13818 may be the subject of
`patent rights. ISO and IEC shall not be held responsible for identifying any or all such patentrights.
`
`International Standard ISO/IEC 13818-2 was prepared by Joint Technical Committee ISO/IEC JTC 1, Information
`technology, Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia information,
`in
`collaboration with ITU-T. The identical text is published as ITU-T Rec. H.262.
`
`This second edition cancels and replaces the first edition (ISO/IEC 13818-2:1996), which has been technically
`revised.
`
`ISO/IEC 13818 consists of the following parts, under the generaltitle Information technology — Generic coding of
`moving pictures and associated audio information:
`
`— Part 1: Systems
`
`— Part 2: Video
`
`— Part 3: Audio
`
`— Part 4: Conformance testing
`
`— Part 5: Software simulation
`
`— Part 6: Extensions for DSM-CC
`
`— Part 7: Advanced Audio Coding (AAC)
`
`— Part 9: Extension for real time interface for systems decoders
`
`— Part 10: Conformance extensions for Digital Storage Media Command and Control (DSM-CC)
`
`Annexes A, B and C form a normative part of this part of ISO/IEC 13818. AnnexesDto K are for information only.
`
`© ISO/IEC 2000 — All rights reserved
`
`5
`
`Vv
`
`5
`
`
`
`ISO/IEC 13818-2:2000(E)
`Introduction
`
`Intro. 1
`
`Purpose
`
`This Part of this Recommendation | International Standard was developed in response to the growing need for a generic
`coding method of moving pictures and of associated sound for various applications such as digital storage media,
`television broadcasting and communication. The use of this Specification means that motion video can be manipulated as
`a form of computer data and can be stored on various storage media, transmitted and received over existing and future
`networks and distributed on existing and future broadcasting channels.
`
`Intro. 2
`
`Application
`
`The applications of this Specification cover, but are not limited to, such areas as listed below:
`
`BSS
`
`Broadcasting Satellite Service (to the home)
`
`CATV Cable TV Distribution on optical networks, copper, etc.
`
`CDAD Cable Digital Audio Distribution
`
`DSB
`
`Digital Sound Broadcasting(terrestrial and satellite broadcasting)
`
`DTTB Digital Terrestrial Television Broadcasting
`EC
`Electronic Cinema
`
`ENG
`
`FSS
`HIT
`
`IPC
`
`ISM
`
`Electronic News Gathering (including SNG,Satellite News Gathering)
`
`Fixed Satellite Service (e.g. to head ends)
`Home Television Theatre
`
`Interpersonal Communications (videoconferencing, videophone,etc.)
`
`Interactive Storage Media (optical disks, etc.)
`
`NCA
`
`News and Current Affairs
`
`NDB
`RVS
`
`Networked Database Services (via ATM, etc.)
`Remote Video Surveillance
`
`SSM
`
`Serial Storage Media (digital VTR, etc.)
`
`Intro. 3
`
`Profiles andlevels
`
`This Specification is intended to be generic in the sense that it serves a wide range of applications, bitrates, resolutions,
`qualities and services. Applications should cover, among other things, digital storage media, television broadcasting and
`communications. In the course of creating this Specification, various requirements from typical applications have been
`considered, necessary algorithmic elements have been developed, and they have been integrated into a single syntax.
`Hence,this Specification will facilitate the bitstream interchange among different applications.
`
`Considering the practicality of implementing the full syntax of this Specification, however, a limited number of subsets
`of the syntax are also stipulated by means of "profile" and "level". These and other related terms are formally defined in
`clause 3.
`
`A "profile" is a defined subset of the entire bitstream syntax that is defined by this Specification. Within the bounds
`imposed by the syntax of a given profile it is still possible to require a very large variation in the performance of
`encoders and decoders depending upon the values taken by parameters in the bitstream. For instance, it is possible to
`specify frame sizes as large as (approximately) 2!¢ samples wide by 214 lines high. It is currently neither practical nor
`economic to implement a decoder capable of dealing with all possible framesizes.
`
`In order to deal with this problem, "levels" are defined within each profile. A level is a defined set of constraints imposed
`on parameters in the bitstream. These constraints may be simple limits on numbers. Alternatively they may take the form
`of constraints on arithmetic combinations of the parameters (e.g. frame width multiplied by frame height multiplied by
`frame rate).
`
`Bitstreams complying with this Specification use a common syntax. In order to achieve a subset of the complete syntax,
`flags and parameters are included in the bitstream that signal the presence or otherwise of syntactic elements that occur
`later in the bitstream. In order to specify constraints on the syntax (and hence define a profile), it is thus only necessary to
`constrain the values of these flags and parameters that specify the presence of later syntactic elements.
`
`vi
`
`6
`
`© ISO/IEC 2000 — All rights reserved
`
`6
`
`
`
`ISO/IEC 13818-2:2000(E)
`
`Intro. 4
`
`The scalable and the non-scalable syntax
`
`The full syntax can be divided into two major categories: One is the non-scalable syntax, which is structured as a super
`set of the syntax defined in ISO/IEC 11172-2. The main feature of the non-scalable syntax is the extra compression tools
`for interlaced video signals. The second is the scalable syntax, the key property of which is to enable the reconstruction
`of useful video from pieces of a total bitstream. This is achieved by structuring the total bitstream in two or more layers,
`starting from a standalone base layer and adding a number of enhancement layers. The base layer can use the non-
`scalable syntax, or in somesituations conform to the ISO/IEC 11172-2 syntax.
`
`Intro. 4.1
`
`Overview ofthe non-scalable syntax
`
`The coded representation defined in the non-scalable syntax achieves a high compression ratio while preserving good
`image quality. The algorithm is not lossless as the exact sample values are not preserved during coding. Obtaining good
`image quality at the bitrates of interest demands very high compression, which is not achievable with intra picture coding
`alone. The need for random access, however, is best satisfied with pure intra picture coding. The choice of the techniques
`is based on the need to balance a high image quality and compression ratio with the requirement to make random access
`to the coded bitstream.
`
`A number of techniques are used to achieve high compression. The algorithm first uses block-based motion
`compensation to reduce the temporal redundancy. Motion compensation is used both for causal prediction of the current
`picture from a previous picture, and for non-causal, interpolative prediction from past and future pictures. Motion vectors
`are defined for each 16-sample by 16-line region of the picture. The prediction error, is further compressed using the
`Discrete Cosine Transform (DCT) to remove spatial correlation before it is quantised in an irreversible process that
`discards the less important information. Finally, the motion vectors are combined with the quantised DCT information,
`and encoded using variable length codes.
`
`Intro. 4.1.1.
`
`Temporalprocessing
`
`Because of the conflicting requirements of random access and highly efficient compression, three main picture types are
`defined. Intra Coded Pictures (I-Pictures) are coded without reference to other pictures. They provide access points to the
`coded sequence where decoding can begin, but are coded with only moderate compression. Predictive Coded Pictures (P-
`Pictures) are coded more efficiently using motion compensated prediction from a past intra or predictive coded picture
`and are generally used as a reference for further prediction. Bidirectionally-predictive Coded Pictures (B-Pictures)
`provide the highest degree of compression but require both past and future reference pictures for motion compensation.
`Bidirectionally-predictive coded pictures are never used as references for prediction (except in the case that the resulting
`picture is used as a reference in a spatially scalable enhancement layer). The organisation of the three picture types in a
`sequenceis very flexible. The choice is left to the encoder and will depend on the requirements of the application. Figure
`Intro. 1 illustrates an example of the relationship among the three different picture types.
`
`
`
`Figure Intro. 1 — Example of temporal picture structure
`
`© ISO/IEC 2000 — All rights reserved
`
`7
`
`7
`
`
`
`ISO/IEC 13818-2:2000(E)
`
`Intro. 4.1.2 Coding interlaced video
`
`Each frame ofinterlaced video consists of two fields which are separated by one field-period. The Specification allows
`either the frame to be encoded as picture or the two fields to be encoded as two pictures. Frame encoding or field
`encoding can be adaptively selected on a frame-by-frame basis. Frame encoding is typically preferred when the video
`scene contains significant detail with limited motion. Field encoding, in which the second field can be predicted from the
`first. works better when there is fast movement.
`
`Intro. 4.1.3 Motion representation — Macroblocks
`
`As in ISO/IEC 11172-2, the choice of 16 by 16 macroblocks for the motion-compensation unit is a result of the trade-off
`between the coding gain provided by using motion information and the overhead needed to represent it. Each macroblock
`can be temporally predicted in one of a number of different ways. For example, in frame encoding, the prediction from
`the previous reference frame can itself be either frame-based or field-based. Depending on the type of the macroblock,
`motion vector information and other side information is encoded with the compressed prediction error in each
`macroblock. The motion vectors are encoded differentially with respect to the last encoded motion vectors using variable
`length codes. The maximum length of the motion vectors that may be represented can be programmed, on a picture-by-
`picture basis, so that the most demanding applications can be met without compromising the performance of the system
`in more normal situations.
`
`It is the responsibility of the encoder to calculate appropriate motion vectors. This Specification does not specify how
`this should be done.
`
`Intro. 4.1.4
`
`Spatial redundancy reduction
`
`Both source pictures and prediction errors have high spatial redundancy. This Specification uses a block-based DCT
`method with visually weighted quantisation and run-length coding. After motion compensated prediction or
`interpolation, the resulting prediction error is split into 8 by 8 blocks. These are transformed into the DCT domain where
`they are weighted before being quantised. After quantisation many of the DCT coefficients are zero in value and so
`two-dimensional run-length and variable length coding is used to encode the remaining DCT coefficients efficiently.
`
`Intro. 4.1.5
`
` Chrominance formats
`
`In addition to the 4:2:0 format supported in ISO/IEC 11172-2 this Specification supports 4:2:2 and 4:4:4 chrominance
`formats.
`
`Intro. 4.2
`
`Scalable extensions
`
`Thescalability tools in this Specification are designed to support applications beyond that supported by single layer
`video. Among the noteworthy applications areas addressed are video telecommunications, video on Asynchronous
`Transfer Mode (ATM) networks, interworking of video standards, video service hierarchies with multiple spatial,
`temporal and quality resolutions, HDTV with embedded TV, systems allowing migration to higher temporal resolution
`HDTV, etc. Although a simple solution to scalable video is
`the simulcast
`technique which is based on
`transmission/storage of multiple independently coded reproductions of video, a more efficient alternative is scalable
`video coding, in which the bandwidth allocated to a given reproduction of video can be partially re-utilised in coding of
`the next reproduction of video. In scalable video coding, it is assumed that given a coded bitstream, decoders of various
`complexities can decode and display appropriate reproductions of coded video. A scalable video encoder is likely to have
`increased complexity when compared to a single layer encoder. However, this Recommendation | International Standard
`provides several different
`forms of scalabilities that address non-overlapping applications with corresponding
`complexities. The basic scalability tools offered are:
`
`—
`
`—
`
`—
`
`—
`
`data partitioning;
`
`SNR scalability;
`
`spatial scalability; and
`
`temporal scalability.
`
`Moreover, combinations of these basic scalability tools are also supported and are referred to as hybrid scalability. In the
`case of basic scalability, two layers of video referred to as the /Jower Jayer and the enhancement layer are allowed,
`whereas in hybrid scalability up to three layers are supported. Tables Intro.
`1 to Intro. 3 provide a few example
`applications of various scalabilities.
`
`viii
`
`8
`
`© ISO/IEC 2000 — All rights reserved
`
`8
`
`
`
`Table Intro. 1 — Applications of SNR scalability
`
`ISO/IEC 13818-2:2000(E)
`
`Recommendation Same resolution and format as|Two quality service for Standard TV (SDTV)
`
`ITU-R BT.601
`lower layer
`
`
`
`High Definition Same resolution and format as|Two quality service for HDTV
`lower layer
`
`Vide profioion/ do
`
`Table Intro. 2 — Applications of spatial scalability
`
`
`
`Table Intro. 3 — Applications of temporalscalability
`
`DTV
`
`—H
`
`HDTV
`
`Intro. 4.2.1.
`
`Spatial scalable extension
`
`Spatial scalability is a tool intended for use in video applications involving telecommunications, interworking of video
`standards, video database browsing, interworking of HDTV and TV,etc., i.e. video systems with the primary common
`feature that a minimum of two layers of spatial resolution are necessary. Spatial scalability involves generating two
`spatial resolution video layers from a single video source such that the lower layer is coded byitself to provide the basic
`spatial resolution and the enhancement layer employs the spatially interpolated lower layer and carries the full spatial
`resolution of the input video source. The lower and the enhancement layers may either both use the coding tools in this
`Specification, or the ISO/TEC 11172-2 Standard for the lower layer and this Specification for the enhancement layer. The
`latter case achieves a further advantage by facilitating interworking between video coding standards. Moreover, spatial
`scalability offers flexibility in choice of video formats to be employed in each layer. An additional advantage ofspatial
`scalability is its ability to provide resilience to transmission errors as the more important data of the lower layer can be
`sent over channel with better error performance, while the less critical enhancement layer data can be sent over a channel
`with poor error performance.
`
`Intro. 4.2.2.
`
`SNR scalable extension
`
`SNR scalability is a tool intended for use in video applications involving telecommunications, video services with
`multiple qualities, standard TV and HDTV,i.e. video systems with the primary common feature that a minimum of two
`layers of video quality are necessary. SNR scalability involves generating two video layers of same spatial resolution but
`different video qualities from a single video source such that the lower layer is coded by itself to provide the basic video
`quality and the enhancementlayer is coded to enhance the lower layer. The enhancement layer when added back to the
`
`© ISO/IEC 2000 — All rights reserved
`
`9
`
`9
`
`
`
`ISO/IEC 13818-2:2000(E)
`
`lower layer regenerates a higher quality reproduction of the input video. The lower and the enhancement layers may
`either use this Specification or ISO/IEC 11172-2 Standard for the lower layer and this Specification for the enhancement
`layer. An additional advantage of SNR scalability is its ability to provide high degree of resilience to transmission errors
`as the more important data of the lower layer can be sent over channel with better error performance, while the less
`critical enhancement layer data can be sent over a channel with poor error performance.
`
`Intro. 4.2.3. Temporal scalable extension
`
`intended for use in a range of diverse video applications from telecommunications
`Temporal scalability is a tool
`to HDTV for which migration to higher temporal resolution systems from that of lower temporal resolution systems may
`be necessary. In many cases, the lower temporal resolution video systems may be either the existing systems or the less
`expensive early generation systems, with the motivation of introducing more sophisticated systems gradually. Temporal
`scalability involves partitioning of video frames into layers, whereas the lower layer is coded by itself to provide the
`basic temporal rate and the enhancement layer is coded with temporal prediction with respect to the lower layer, these
`layers when decoded and temporal multiplexed to yield full temporal resolution of the video source. The lower temporal
`resolution systems may only decode the lower layer to provide basic temporal resolution, whereas more sophisticated
`systems of the future may decode both layers and provide high temporal resolution video while maintaining interworking
`with earlier generation systems. An additional advantage of temporal scalability is its ability to provide resilience to
`transmission errors as the more important data of the lower layer can be sent over channel with better error performance,
`while the less critical enhancement layer can be sent over a channel with poor error performance.
`
`Intro. 4.2.4 Data partitioning extension
`
`Data partitioning is a tool intended for use when two channels are available for transmission and/or storage of a
`video bitstream, as may be the case in ATM networks, terrestrial broadcast, magnetic media, etc. The bitstream is
`partitioned between these channels such that more critical parts of the bitstream (such as headers, motion vectors, low
`frequency DCT coefficients) are transmitted in the channel with the better error performance, and less critical data (such
`as higher frequency DCT coefficients) is transmitted in the channel with poor error performance. Thus, degradation to
`channel errors are minimised since the critical parts of a bitstream are better protected. Data from neither channel may be
`decoded on a decoder thatis not intended for decoding data partitioned bitstreams.
`
`x
`
`10
`
`© ISO/IEC 2000 - All rights reserved
`
`10
`
`
`
`INTERNATIONAL STANDARD
`
`ITU-T RECOMMENDATION
`
`ISO/IEC 13818-2 : 2000 (E)
`
`INFORMATION TECHNOLOGY —- GENERIC CODING OF MOVING
`PICTURES AND ASSOCIATED AUDIO INFORMATION: VIDEO
`
`1
`
`Scope
`
`This Recommendation | International Standard specifies the coded representation of picture information for digital
`storage media and digital video communication and specifies the decoding process. The representation supports constant
`bitrate transmission, variable bitrate transmission, random access, channel hopping, scalable decoding, bitstream editing,
`as well as special functions such as fast forward playback, fast reverse playback, slow motion, pause and still pictures.
`This Recommendation | International Standard is forward compatible with ISO/TEC 11172-2 and upward or downward
`compatible with EDTV, HDTV, SDTV formats.
`
`This Recommendation | International Standard is primarily applicable to digital storage media, video broadcast and
`communication. The storage media may be directly connected to the decoder, or via communications means such as
`busses, LANs, or telecommunications links.
`
`2
`
`Normative references
`
`The following Recommendations and International Standards contain provisions which, through reference in this text,
`constitute provisions of this Recommendation | International Standard. At the time of publication, the editions indicated
`were valid. All Recommendations and Standards are subject to revision, and parties to agreements based on this
`Recommendation | International Standard are encouraged to investigate the possibility of applying the most recent
`edition of the Recommendations and Standards indicated below. Members of IEC and ISO maintain registers of currently
`valid International Standards. The Telecommunication Standardization Bureau of ITU maintainsalist of currently valid
`ITU-T Recommendations.
`
`—
`
`—
`
`—
`
`—
`
`—
`
`—
`
`—
`
`—
`
`—
`
`—
`
`—
`
`—
`
`Recommendation ITU-R BT.601-3 (1992), Encoding parameters ofdigital televisionfor studios.
`
`Recommendation ITU-R BR.648 (1986), Digital recording ofaudio signals.
`
`Report ITU-R 955-2 (1990), Satellite sound broadcasting with portable receivers and receivers in
`automobiles.
`
`ISO/IEC 11172-1:1993, Information technology — Coding of moving pictures and associated audio for
`digital storage media at up to about 1,5 Mbit/s — Part 1 : Systems.
`
`ISO/IEC 11172-2:1993, Information technology — Coding of moving pictures and associated audio for
`digital storage media at up to about 1,5 Mbit/s — Part 2 : Video.
`
`ISO/IEC 11172-3:1993, Information technology — Coding of moving pictures and associated audio for
`digital storage media at up to about 1,5 Mbit/s — Part 3 : Audio.
`
`IEEE 1180:1990, Standard Specifications for the Implementations of 8 by 8 Inverse Discrete Cosine
`Transform.
`
`IEC 60908 (1999), Audio recording — Compact disc digital audio system.
`
`IEC 60461 (1986), Time and control codefor video tape recorders.
`
`ITU-T Recommendation H.261 (1993), Video codecfor audiovisual services atp X 64 kbit/s.
`
`ITU-T Recommendation H.320 (1999), Narrow-band visual telephone systems and terminal equipment.
`
`Information technology — Digital
`ISO/IEC 10918-1:1994,
`CCITT Recommendation T.81 (1992)
`compression and coding ofcontinuous-tone still images: Requirements and guidelines. (JPEG.)
`
`11
`
`ITU-T Rec. H.262 (2000 E)
`
`1
`
`11
`
`
`
`ISO/IEC 13818-2 : 2000 (E)
`
`3
`
`Definitions
`
`For the purposes of this Recommendation| International Standard, the following definitions apply.
`
`a1
`
`ACcoefficient: Any DCT coefficient for which the frequency in one or both dimensions is non-zero.
`
`big picture: A coded picture that would cause VBV buffer underflow as defined in C.7. Big pictures can only
`32
`occur in sequences where low_delay is equal to 1. "Skipped picture" is a term that is sometimes used to describe the
`same concept.
`
`33
`
`3.4
`
`B-field picture: A field structure B-Picture.
`
`B-frame picture: A frame structure B-Picture.
`
`B-picture; bidirectionally predictive-coded picture: A picture that is coded using motion compensated
`35
`prediction from past and/or future referencefields or frames.
`
`back



