Zum Hauptinhalt springen

Method for mixed NAL unit type support in a coded picture

TENCENT AMERICA, LLC
2024
Online Patent

Titel:
Method for mixed NAL unit type support in a coded picture
Autor/in / Beteiligte Person: TENCENT AMERICA, LLC
Link:
Veröffentlichung: 2024
Medientyp: Patent
Sonstiges:
  • Nachgewiesen in: USPTO Patent Grants
  • Sprachen: English
  • Patent Number: 11956,442
  • Publication Date: April 09, 2024
  • Appl. No: 17/851467
  • Application Filed: June 28, 2022
  • Assignees: TENCENT AMERICA LLC (Palo Alto, CA, US)
  • Claim: 1. A method performed by at least one processor, the method comprising: receiving a first video coding layer (VCL) network abstraction layer (NAL) unit of a first slice of a coded picture and a second VCL NAL unit of a second slice of the coded picture, the first VCL NAL unit having a first VCL NAL unit type and the second VCL NAL unit having a second VCL NAL unit type that is different from the first VCL NAL unit type; and decoding the coded picture, the decoding comprising determining a picture type of the coded picture based on: a first “nal_unit_type” syntax element that indicates the first VCL NAL unit type of the first VCL NAL unit of the coded picture, and a second “nal_unit_type” syntax element that indicates the second VCL NAL unit type of the second VCL NAL unit of the coded picture, wherein the first “nal_unit_type” syntax element and the second “nal_unit_type” syntax element have different values from each other, or an indicator, received by the at least one processor, indicating that the coded picture includes VCL NAL units that respectively have “nal_unit_type” syntax elements with different values from each other.
  • Claim: 2. The method of claim 1 , wherein the determining comprises determining the coded picture is a trailing picture based on the first “nal_unit_type” syntax element indicating the first VCL NAL unit contains a trailing picture coded slice, and the second “nal_unit_type” syntax element indicating the second VCL NAL unit contains an instantaneous decoding refresh (IDR) picture coded slice or a clean random access (CRA) picture coded slice.
  • Claim: 3. The method of claim 1 , wherein the determining comprises determining the coded picture is a random access decodable leading (RADL) picture based on the first “nal_unit_type” syntax element indicating the first VCL NAL unit contains a RADL picture coded slice, and the second “nal_unit_type” syntax element indicating the second VCL NAL unit contains an instantaneous decoding refresh (IDR) picture coded slice or a clean random access (CRA) picture coded slice.
  • Claim: 4. The method of claim 1 , wherein the determining comprises determining the coded picture is a step-wise temporal sub-layer access (STSA) picture based on the first “nal_unit_type” syntax element indicating the first VCL NAL unit contains an STSA picture coded slice, and the second “nal_unit_type” syntax element indicating the second VCL NAL unit does not contain an instantaneous decoding refresh (IDR) picture coded slice.
  • Claim: 5. The method of claim 1 , wherein the determining comprises determining the coded picture is a trailing picture based on the first “nal_unit_type” syntax element indicating the first VCL NAL unit contains a step-wise temporal sub-layer access (STSA) picture coded slice, and the second “nal_unit_type” syntax element indicating the second VCL NAL unit does not contain a clean random access (CRA) picture coded slice.
  • Claim: 6. The method of claim 1 , wherein the determining comprises determining the coded picture is a trailing picture based on the first “nal_unit_type” syntax element indicating the first VCL NAL unit contains a gradual decoding refresh (GDR) picture coded slice, and the second “nal_unit_type” syntax element indicating the second VCL NAL unit contains does not contain an instantaneous decoding refresh (IDR) picture coded slice or a clean random access (CRA) picture coded slice.
  • Claim: 7. The method of claim 1 , wherein the indicator is a flag, and the determining comprises determining the coded picture is a trailing picture based on the flag indicating that the coded picture includes VCL NAL units that respectively have “nal_unit_type” syntax elements with different values from each other.
  • Claim: 8. The method of claim 1 , wherein the indicator is a flag, and the decoding the coded picture further comprises determining a temporal ID of the coded picture is 0 based on the flag indicating that the coded picture includes VCL NAL units that respectively have “nal_unit_type” syntax elements with different values from each other.
  • Claim: 9. The method of claim 1 , wherein, the indicator is a flag, and the method further comprises receiving the flag in a picture header or a slice header.
  • Claim: 10. The method of claim 1 , wherein, the indicator is a flag, and the coded picture is in a first layer, and the method further comprises: receiving the flag; and determining that an additional coded picture, that is in a second layer that is a reference layer of the first layer, includes VCL NAL units that respectively have “nal_unit_type” syntax elements with different values from each other based on the flag indicating the coded picture includes VCL NAL units that respectively have “nal_unit_type” syntax elements with different values from each other.
  • Claim: 11. A system comprising: memory configured to store computer program code; and at least one processor configured to receive at least one coded video stream, access the computer program code, and operate as instructed by the computer program code, the computer program code comprising: decoding code configured to cause the at least one processor to decode a coded picture from the at least one coded video stream, the decoding code comprising determining code configured to cause the at least one processor to determine a picture type of the coded picture based on: a first “nal_unit_type” syntax element that indicates a first video coding layer (VCL) network abstraction layer (NAL) unit type of a first VCL NAL unit of a first slice of the coded picture, and a second “nal_unit_type” syntax element that indicates a second VCL NAL unit type of a second VCL NAL unit of a second slice of the coded picture, wherein the first “nal_unit_type” syntax element and the second “nal_unit_type” syntax element have different values from each other, or an indicator, received by the at least one processor, indicating that the coded picture includes VCL NAL units that respectively have “nal_unit_type” syntax elements with different values from each other, wherein the first VCL NAL unit type is different from the second VCL NAL unit type.
  • Claim: 12. The system of claim 11 , wherein the determining code is configured to cause the at least one processor to determine the coded picture is a trailing picture based on the first “nal_unit_type” syntax element indicating the first VCL NAL unit contains a trailing picture coded slice, and the second “nal_unit_type” syntax element indicating the second VCL NAL unit contains an instantaneous decoding refresh (IDR) picture coded slice or a clean random access (CRA) picture coded slice.
  • Claim: 13. The system of claim 11 , wherein the determining code is configured to cause the at least one processor to determine the coded picture is a random access decodable leading (RADL) picture based on the first “nal_unit_type” syntax element indicating the first VCL NAL unit contains a RADL picture coded slice, and the second “nal_unit_type” syntax element indicating the second VCL NAL unit contains an instantaneous decoding refresh (IDR) picture coded slice or a clean random access (CRA) picture coded slice.
  • Claim: 14. The system of claim 11 , wherein the determining code is configured to cause the at least one processor to determine the coded picture is a step-wise temporal sub-layer access (STSA) picture based on the first “nal_unit_type” syntax element indicating the first VCL NAL unit contains an STSA picture coded slice, and the second “nal_unit_type” syntax element indicating the second VCL NAL unit does not contain an instantaneous decoding refresh (IDR) picture coded slice.
  • Claim: 15. The system of claim 11 , wherein the determining code is configured to cause the at least one processor to determine the coded picture is a trailing picture based on the first “nal_unit_type” syntax element indicating the first VCL NAL unit contains a step-wise temporal sub-layer access (STSA) picture coded slice, and the second “nal_unit_type” syntax element indicating the second VCL NAL unit does not contain a clean random access (CRA) picture coded slice.
  • Claim: 16. The system of claim 11 , wherein the determining code is configured to cause the at least one processor to determine the coded picture is a trailing picture based on the first “nal_unit_type” syntax element indicating the first VCL NAL unit contains a gradual decoding refresh (GDR) picture coded slice, and the second “nal_unit_type” syntax element indicating the second VCL NAL unit does not contain an instantaneous decoding refresh (IDR) picture coded slice or a clean random access (CRA) picture coded slice.
  • Claim: 17. The system of claim 11 , wherein the indicator is a flag, and the determining code is configured to cause the at least one processor to determine the coded picture is a trailing picture based on the flag indicating that the coded picture includes VCL NAL units that respectively have “nal_unit_type” syntax elements with different values from each other.
  • Claim: 18. The system of claim 11 , wherein the indicator is a flag, and the determining code is further configured to cause the at least one processor to determine that a temporal ID of the coded picture is 0 based on the flag indicating that the coded picture includes VCL NAL units that respectively have “nal_unit_type” syntax elements with different values from each other.
  • Claim: 19. The system of claim 11 , wherein, the indicator is a flag, and the at least one processor is configured to receive the flag in a picture header or a slice header.
  • Claim: 20. A non-transitory computer-readable medium storing computer instructions that, when executed by at least one processor, cause the at least one processor to: decode a coded picture from at least one coded video stream, the decoding comprising determining a picture type of the coded picture based on: a first “nal_unit_type” syntax element that indicates a first video coding layer (VCL) network abstraction layer (NAL) unit type of a first VCL NAL unit of a first slice of the coded picture, and a second “nal_unit_type” syntax element that indicates a second VCL NAL unit type of a second VCL NAL unit of a second slice of the coded picture, wherein the first “nal_unit_type” syntax element and the second “nal_unit_type” syntax element have different values from each other, or an indicator, received by the at least one processor, indicating that the coded picture includes VCL NAL units that respectively have “nal_unit_type” syntax elements with different values from each other, wherein the first VCL NAL unit type is different from the second VCL NAL unit type.
  • Patent References Cited: 9584804 February 2017 Wang ; 9648322 May 2017 Wang ; 10999590 May 2021 Choi ; 11032546 June 2021 Krishnan ; 11057636 July 2021 Huang ; 20140016697 January 2014 Wang ; 20140016707 January 2014 Wang ; 20150016546 January 2015 Wang et al. ; 20150085917 March 2015 Hendry et al. ; 20160234517 August 2016 Samuelsson et al. ; 20200092577 March 2020 Huang ; 20200092578 March 2020 Huang ; 20200169748 May 2020 Chen ; 20200280736 September 2020 Wang ; 20200381022 December 2020 Ilola ; 20210092359 March 2021 Nassor ; 20210092450 March 2021 Choi ; 20210105492 April 2021 Aksu ; 20210136396 May 2021 Deshpande ; 20210194946 June 2021 Hannuksela ; 20210250617 August 2021 Hannuksela ; 20210281858 September 2021 Hannuksela ; 20210297705 September 2021 Sjöberg ; 2020185922 September 2020
  • Other References: Benjamin Bross et al., “Versatile Video Coding (Draft 6)”, Joint Video Experts Team (JVET) of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11, 15th Meeting, JVET-O2001-vE, Jul. 3-12, 2019, 455 pages, Gothenburg, SE. cited by applicant ; Benjamin Bross et al., “Versatile Video Coding (Draft 7)”, Joint Video Experts Team (JVET) of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11, 16th Meeting, JVET-P2001-vE, Oct. 1-11, 2019, 492 pages, Geneva, CH. cited by applicant ; International Search Report dated Mar. 8, 2021, in International Application No. PCT/US2020/065318. cited by applicant ; Written Opinion dated Mar. 8, 2021, in International Application No. PCT/US2020/065318. cited by applicant ; Notification of reasons for refusal dated Oct. 11, 2022 from the Japanese Patent Office in Japanese Application No. 2021-559109. cited by applicant ; Robert Skupin et al., “AHG9: On mixing NAL unit types in a coded picture”, Joint Video Experts Team (JVET) of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11, Document: JVET:Q0396, 2020, pp. 1-6 (9 pages total). cited by applicant ; Miska M. Hannuksela, “AHG9: On Miixed NAL unit types in a coded picture”, Joint Video Experts Team (JVET) of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11, Document: JVET-Q0239, 2020, pp. 1-4 (6 pages total). cited by applicant ; Extended European Search Report from the European Patent Office dated Dec. 22, 2023 in Application No. 209098870. cited by applicant ; Miska M. Hannuksela, “AHG12/AHG17: On signalling of picture-specific syntax elements in access unit delimiter”, Joint Video Experts Team (JVET) of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11, JVET-P0095, 16th Meeting: Geneva, CH, Oct. 1-11, 2019, pp. 1-6 (6 pages total). cited by applicant
  • Primary Examiner: Aghevli, Reza
  • Attorney, Agent or Firm: Sughrue Mion, PLLC

Klicken Sie ein Format an und speichern Sie dann die Daten oder geben Sie eine Empfänger-Adresse ein und lassen Sie sich per Email zusenden.

oder
oder

Wählen Sie das für Sie passende Zitationsformat und kopieren Sie es dann in die Zwischenablage, lassen es sich per Mail zusenden oder speichern es als PDF-Datei.

oder
oder

Bitte prüfen Sie, ob die Zitation formal korrekt ist, bevor Sie sie in einer Arbeit verwenden. Benutzen Sie gegebenenfalls den "Exportieren"-Dialog, wenn Sie ein Literaturverwaltungsprogramm verwenden und die Zitat-Angaben selbst formatieren wollen.

xs 0 - 576
sm 576 - 768
md 768 - 992
lg 992 - 1200
xl 1200 - 1366
xxl 1366 -