docs/specs/MESA_shader_integer_functions.txt

b8e80941SmrgName
b8e80941Smrg
b8e80941Smrg    MESA_shader_integer_functions
b8e80941Smrg
b8e80941SmrgName Strings
b8e80941Smrg
b8e80941Smrg    GL_MESA_shader_integer_functions
b8e80941Smrg
b8e80941SmrgContact
b8e80941Smrg
b8e80941Smrg    Ian Romanick <ian.d.romanick@intel.com>
b8e80941Smrg
b8e80941SmrgContributors
b8e80941Smrg
b8e80941Smrg    All the contributors of GL_ARB_gpu_shader5
b8e80941Smrg
b8e80941SmrgStatus
b8e80941Smrg
b8e80941Smrg    Supported by all GLSL 1.30 capable drivers in Mesa 12.1 and later
b8e80941Smrg
b8e80941SmrgVersion
b8e80941Smrg
b8e80941Smrg    Version 3, March 31, 2017
b8e80941Smrg
b8e80941SmrgNumber
b8e80941Smrg
b8e80941Smrg    OpenGL Extension #495
b8e80941Smrg
b8e80941SmrgDependencies
b8e80941Smrg
b8e80941Smrg    This extension is written against the OpenGL 3.2 (Compatibility Profile)
b8e80941Smrg    Specification.
b8e80941Smrg
b8e80941Smrg    This extension is written against Version 1.50 (Revision 09) of the OpenGL
b8e80941Smrg    Shading Language Specification.
b8e80941Smrg
b8e80941Smrg    GLSL 1.30 (OpenGL) or GLSL ES 3.00 (OpenGL ES) is required.
b8e80941Smrg
b8e80941Smrg    This extension interacts with ARB_gpu_shader5.
b8e80941Smrg
b8e80941Smrg    This extension interacts with ARB_gpu_shader_fp64.
b8e80941Smrg
b8e80941Smrg    This extension interacts with NV_gpu_shader5.
b8e80941Smrg
b8e80941SmrgOverview
b8e80941Smrg
b8e80941Smrg    GL_ARB_gpu_shader5 extends GLSL in a number of useful ways.  Much of this
b8e80941Smrg    added functionality requires significant hardware support.  There are many
b8e80941Smrg    aspects, however, that can be easily implmented on any GPU with "real"
b8e80941Smrg    integer support (as opposed to simulating integers using floating point
b8e80941Smrg    calculations).
b8e80941Smrg
b8e80941Smrg    This extension provides a set of new features to the OpenGL Shading
b8e80941Smrg    Language to support capabilities of these GPUs, extending the
b8e80941Smrg    capabilities of version 1.30 of the OpenGL Shading Language and version
b8e80941Smrg    3.00 of the OpenGL ES Shading Language.  Shaders using the new
b8e80941Smrg    functionality provided by this extension should enable this
b8e80941Smrg    functionality via the construct
b8e80941Smrg
b8e80941Smrg      #extension GL_MESA_shader_integer_functions : require   (or enable)
b8e80941Smrg
b8e80941Smrg    This extension provides a variety of new features for all shader types,
b8e80941Smrg    including:
b8e80941Smrg
b8e80941Smrg      * support for implicitly converting signed integer types to unsigned
b8e80941Smrg        types, as well as more general implicit conversion and function
b8e80941Smrg        overloading infrastructure to support new data types introduced by
b8e80941Smrg        other extensions;
b8e80941Smrg
b8e80941Smrg      * new built-in functions supporting:
b8e80941Smrg
b8e80941Smrg        * splitting a floating-point number into a significand and exponent
b8e80941Smrg          (frexp), or building a floating-point number from a significand and
b8e80941Smrg          exponent (ldexp);
b8e80941Smrg
b8e80941Smrg        * integer bitfield manipulation, including functions to find the
b8e80941Smrg          position of the most or least significant set bit, count the number
b8e80941Smrg          of one bits, and bitfield insertion, extraction, and reversal;
b8e80941Smrg
b8e80941Smrg        * extended integer precision math, including add with carry, subtract
b8e80941Smrg          with borrow, and extenended multiplication;
b8e80941Smrg
b8e80941Smrg    The resulting extension is a strict subset of GL_ARB_gpu_shader5.
b8e80941Smrg
b8e80941SmrgIP Status
b8e80941Smrg
b8e80941Smrg    No known IP claims.
b8e80941Smrg
b8e80941SmrgNew Procedures and Functions
b8e80941Smrg
b8e80941Smrg    None
b8e80941Smrg
b8e80941SmrgNew Tokens
b8e80941Smrg
b8e80941Smrg    None
b8e80941Smrg
b8e80941SmrgAdditions to Chapter 2 of the OpenGL 3.2 (Compatibility Profile) Specification
b8e80941Smrg(OpenGL Operation)
b8e80941Smrg
b8e80941Smrg    None.
b8e80941Smrg
b8e80941SmrgAdditions to Chapter 3 of the OpenGL 3.2 (Compatibility Profile) Specification
b8e80941Smrg(Rasterization)
b8e80941Smrg
b8e80941Smrg    None.
b8e80941Smrg
b8e80941SmrgAdditions to Chapter 4 of the OpenGL 3.2 (Compatibility Profile) Specification
b8e80941Smrg(Per-Fragment Operations and the Frame Buffer)
b8e80941Smrg
b8e80941Smrg    None.
b8e80941Smrg
b8e80941SmrgAdditions to Chapter 5 of the OpenGL 3.2 (Compatibility Profile) Specification
b8e80941Smrg(Special Functions)
b8e80941Smrg
b8e80941Smrg    None.
b8e80941Smrg
b8e80941SmrgAdditions to Chapter 6 of the OpenGL 3.2 (Compatibility Profile) Specification
b8e80941Smrg(State and State Requests)
b8e80941Smrg
b8e80941Smrg    None.
b8e80941Smrg
b8e80941SmrgAdditions to Appendix A of the OpenGL 3.2 (Compatibility Profile)
b8e80941SmrgSpecification (Invariance)
b8e80941Smrg
b8e80941Smrg    None.
b8e80941Smrg
b8e80941SmrgAdditions to the AGL/GLX/WGL Specifications
b8e80941Smrg
b8e80941Smrg    None.
b8e80941Smrg
b8e80941SmrgModifications to The OpenGL Shading Language Specification, Version 1.50
b8e80941Smrg(Revision 09)
b8e80941Smrg
b8e80941Smrg    Including the following line in a shader can be used to control the
b8e80941Smrg    language features described in this extension:
b8e80941Smrg
b8e80941Smrg      #extension GL_MESA_shader_integer_functions : <behavior>
b8e80941Smrg
b8e80941Smrg    where <behavior> is as specified in section 3.3.
b8e80941Smrg
b8e80941Smrg    New preprocessor #defines are added to the OpenGL Shading Language:
b8e80941Smrg
b8e80941Smrg      #define GL_MESA_shader_integer_functions        1
b8e80941Smrg
b8e80941Smrg
b8e80941Smrg    Modify Section 4.1.10, Implicit Conversions, p. 27
b8e80941Smrg
b8e80941Smrg    (modify table of implicit conversions)
b8e80941Smrg
b8e80941Smrg                                Can be implicitly
b8e80941Smrg        Type of expression        converted to
b8e80941Smrg        ---------------------   -----------------
b8e80941Smrg        int                     uint, float
b8e80941Smrg        ivec2                   uvec2, vec2
b8e80941Smrg        ivec3                   uvec3, vec3
b8e80941Smrg        ivec4                   uvec4, vec4
b8e80941Smrg
b8e80941Smrg        uint                    float
b8e80941Smrg        uvec2                   vec2
b8e80941Smrg        uvec3                   vec3
b8e80941Smrg        uvec4                   vec4
b8e80941Smrg
b8e80941Smrg    (modify second paragraph of the section) No implicit conversions are
b8e80941Smrg    provided to convert from unsigned to signed integer types or from
b8e80941Smrg    floating-point to integer types.  There are no implicit array or structure
b8e80941Smrg    conversions.
b8e80941Smrg
b8e80941Smrg    (insert before the final paragraph of the section) When performing
b8e80941Smrg    implicit conversion for binary operators, there may be multiple data types
b8e80941Smrg    to which the two operands can be converted.  For example, when adding an
b8e80941Smrg    int value to a uint value, both values can be implicitly converted to uint
b8e80941Smrg    and float.  In such cases, a floating-point type is chosen if either
b8e80941Smrg    operand has a floating-point type.  Otherwise, an unsigned integer type is
b8e80941Smrg    chosen if either operand has an unsigned integer type.  Otherwise, a
b8e80941Smrg    signed integer type is chosen.
b8e80941Smrg
b8e80941Smrg
b8e80941Smrg    Modify Section 5.9, Expressions, p. 57
b8e80941Smrg
b8e80941Smrg    (modify bulleted list as follows, adding support for implicit conversion
b8e80941Smrg    between signed and unsigned types)
b8e80941Smrg
b8e80941Smrg    Expressions in the shading language are built from the following:
b8e80941Smrg
b8e80941Smrg    * Constants of type bool, int, int64_t, uint, uint64_t, float, all vector
b8e80941Smrg      types, and all matrix types.
b8e80941Smrg
b8e80941Smrg    ...
b8e80941Smrg
b8e80941Smrg    * The operator modulus (%) operates on signed or unsigned integer scalars
b8e80941Smrg      or vectors.  If the fundamental types of the operands do not match, the
b8e80941Smrg      conversions from Section 4.1.10 "Implicit Conversions" are applied to
b8e80941Smrg      produce matching types.  ...
b8e80941Smrg
b8e80941Smrg
b8e80941Smrg    Modify Section 6.1, Function Definitions, p. 63
b8e80941Smrg
b8e80941Smrg    (modify description of overloading, beginning at the top of p. 64)
b8e80941Smrg
b8e80941Smrg     Function names can be overloaded.  The same function name can be used for
b8e80941Smrg     multiple functions, as long as the parameter types differ.  If a function
b8e80941Smrg     name is declared twice with the same parameter types, then the return
b8e80941Smrg     types and all qualifiers must also match, and it is the same function
b8e80941Smrg     being declared.  For example,
b8e80941Smrg
b8e80941Smrg       vec4 f(in vec4 x, out vec4  y);   // (A)
b8e80941Smrg       vec4 f(in vec4 x, out uvec4 y);   // (B) okay, different argument type
b8e80941Smrg       vec4 f(in ivec4 x, out uvec4 y);  // (C) okay, different argument type
b8e80941Smrg
b8e80941Smrg       int  f(in vec4 x, out ivec4 y);  // error, only return type differs
b8e80941Smrg       vec4 f(in vec4 x, in  vec4  y);  // error, only qualifier differs
b8e80941Smrg       vec4 f(const in vec4 x, out vec4 y);  // error, only qualifier differs
b8e80941Smrg
b8e80941Smrg     When function calls are resolved, an exact type match for all the
b8e80941Smrg     arguments is sought.  If an exact match is found, all other functions are
b8e80941Smrg     ignored, and the exact match is used.  If no exact match is found, then
b8e80941Smrg     the implicit conversions in Section 4.1.10 (Implicit Conversions) will be
b8e80941Smrg     applied to find a match.  Mismatched types on input parameters (in or
b8e80941Smrg     inout or default) must have a conversion from the calling argument type
b8e80941Smrg     to the formal parameter type.  Mismatched types on output parameters (out
b8e80941Smrg     or inout) must have a conversion from the formal parameter type to the
b8e80941Smrg     calling argument type.
b8e80941Smrg
b8e80941Smrg     If implicit conversions can be used to find more than one matching
b8e80941Smrg     function, a single best-matching function is sought.  To determine a best
b8e80941Smrg     match, the conversions between calling argument and formal parameter
b8e80941Smrg     types are compared for each function argument and pair of matching
b8e80941Smrg     functions.  After these comparisons are performed, each pair of matching
b8e80941Smrg     functions are compared.  A function definition A is considered a better
b8e80941Smrg     match than function definition B if:
b8e80941Smrg
b8e80941Smrg       * for at least one function argument, the conversion for that argument
b8e80941Smrg         in A is better than the corresponding conversion in B; and
b8e80941Smrg
b8e80941Smrg       * there is no function argument for which the conversion in B is better
b8e80941Smrg         than the corresponding conversion in A.
b8e80941Smrg
b8e80941Smrg     If a single function definition is considered a better match than every
b8e80941Smrg     other matching function definition, it will be used.  Otherwise, a
b8e80941Smrg     semantic error occurs and the shader will fail to compile.
b8e80941Smrg
b8e80941Smrg     To determine whether the conversion for a single argument in one match is
b8e80941Smrg     better than that for another match, the following rules are applied, in
b8e80941Smrg     order:
b8e80941Smrg
b8e80941Smrg       1. An exact match is better than a match involving any implicit
b8e80941Smrg          conversion.
b8e80941Smrg
b8e80941Smrg       2. A match involving an implicit conversion from float to double is
b8e80941Smrg          better than a match involving any other implicit conversion.
b8e80941Smrg
b8e80941Smrg       3. A match involving an implicit conversion from either int or uint to
b8e80941Smrg          float is better than a match involving an implicit conversion from
b8e80941Smrg          either int or uint to double.
b8e80941Smrg
b8e80941Smrg     If none of the rules above apply to a particular pair of conversions,
b8e80941Smrg     neither conversion is considered better than the other.
b8e80941Smrg
b8e80941Smrg     For the function prototypes (A), (B), and (C) above, the following
b8e80941Smrg     examples show how the rules apply to different sets of calling argument
b8e80941Smrg     types:
b8e80941Smrg
b8e80941Smrg       f(vec4, vec4);        // exact match of vec4 f(in vec4 x, out vec4 y)
b8e80941Smrg       f(vec4, uvec4);       // exact match of vec4 f(in vec4 x, out ivec4 y)
b8e80941Smrg       f(vec4, ivec4);       // matched to vec4 f(in vec4 x, out vec4 y)
b8e80941Smrg                             //   (C) not relevant, can't convert vec4 to
b8e80941Smrg                             //   ivec4.  (A) better than (B) for 2nd
b8e80941Smrg                             //   argument (rule 2), same on first argument.
b8e80941Smrg       f(ivec4, vec4);       // NOT matched.  All three match by implicit
b8e80941Smrg                             //   conversion.  (C) is better than (A) and (B)
b8e80941Smrg                             //   on the first argument.  (A) is better than
b8e80941Smrg                             //   (B) and (C).
b8e80941Smrg
b8e80941Smrg
b8e80941Smrg    Modify Section 8.3, Common Functions, p. 84
b8e80941Smrg
b8e80941Smrg    (add support for single-precision frexp and ldexp functions)
b8e80941Smrg
b8e80941Smrg    Syntax:
b8e80941Smrg
b8e80941Smrg      genType frexp(genType x, out genIType exp);
b8e80941Smrg      genType ldexp(genType x, in genIType exp);
b8e80941Smrg
b8e80941Smrg    The function frexp() splits each single-precision floating-point number in
b8e80941Smrg    <x> into a binary significand, a floating-point number in the range [0.5,
b8e80941Smrg    1.0), and an integral exponent of two, such that:
b8e80941Smrg
b8e80941Smrg      x = significand * 2 ^ exponent
b8e80941Smrg
b8e80941Smrg    The significand is returned by the function; the exponent is returned in
b8e80941Smrg    the parameter <exp>.  For a floating-point value of zero, the significant
b8e80941Smrg    and exponent are both zero.  For a floating-point value that is an
b8e80941Smrg    infinity or is not a number, the results of frexp() are undefined.
b8e80941Smrg
b8e80941Smrg    If the input <x> is a vector, this operation is performed in a
b8e80941Smrg    component-wise manner; the value returned by the function and the value
b8e80941Smrg    written to <exp> are vectors with the same number of components as <x>.
b8e80941Smrg
b8e80941Smrg    The function ldexp() builds a single-precision floating-point number from
b8e80941Smrg    each significand component in <x> and the corresponding integral exponent
b8e80941Smrg    of two in <exp>, returning:
b8e80941Smrg
b8e80941Smrg      significand * 2 ^ exponent
b8e80941Smrg
b8e80941Smrg    If this product is too large to be represented as a single-precision
b8e80941Smrg    floating-point value, the result is considered undefined.
b8e80941Smrg
b8e80941Smrg    If the input <x> is a vector, this operation is performed in a
b8e80941Smrg    component-wise manner; the value passed in <exp> and returned by the
b8e80941Smrg    function are vectors with the same number of components as <x>.
b8e80941Smrg
b8e80941Smrg
b8e80941Smrg    (add support for new integer built-in functions)
b8e80941Smrg
b8e80941Smrg    Syntax:
b8e80941Smrg
b8e80941Smrg      genIType bitfieldExtract(genIType value, int offset, int bits);
b8e80941Smrg      genUType bitfieldExtract(genUType value, int offset, int bits);
b8e80941Smrg
b8e80941Smrg      genIType bitfieldInsert(genIType base, genIType insert, int offset,
b8e80941Smrg                              int bits);
b8e80941Smrg      genUType bitfieldInsert(genUType base, genUType insert, int offset,
b8e80941Smrg                              int bits);
b8e80941Smrg
b8e80941Smrg      genIType bitfieldReverse(genIType value);
b8e80941Smrg      genUType bitfieldReverse(genUType value);
b8e80941Smrg
b8e80941Smrg      genIType bitCount(genIType value);
b8e80941Smrg      genIType bitCount(genUType value);
b8e80941Smrg
b8e80941Smrg      genIType findLSB(genIType value);
b8e80941Smrg      genIType findLSB(genUType value);
b8e80941Smrg
b8e80941Smrg      genIType findMSB(genIType value);
b8e80941Smrg      genIType findMSB(genUType value);
b8e80941Smrg
b8e80941Smrg    The function bitfieldExtract() extracts bits <offset> through
b8e80941Smrg    <offset>+<bits>-1 from each component in <value>, returning them in the
b8e80941Smrg    least significant bits of corresponding component of the result.  For
b8e80941Smrg    unsigned data types, the most significant bits of the result will be set
b8e80941Smrg    to zero.  For signed data types, the most significant bits will be set to
b8e80941Smrg    the value of bit <offset>+<base>-1.  If <bits> is zero, the result will be
b8e80941Smrg    zero.  The result will be undefined if <offset> or <bits> is negative, or
b8e80941Smrg    if the sum of <offset> and <bits> is greater than the number of bits used
b8e80941Smrg    to store the operand.  Note that for vector versions of bitfieldExtract(),
b8e80941Smrg    a single pair of <offset> and <bits> values is shared for all components.
b8e80941Smrg
b8e80941Smrg    The function bitfieldInsert() inserts the <bits> least significant bits of
b8e80941Smrg    each component of <insert> into the corresponding component of <base>.
b8e80941Smrg    The result will have bits numbered <offset> through <offset>+<bits>-1
b8e80941Smrg    taken from bits 0 through <bits>-1 of <insert>, and all other bits taken
b8e80941Smrg    directly from the corresponding bits of <base>.  If <bits> is zero, the
b8e80941Smrg    result will simply be <base>.  The result will be undefined if <offset> or
b8e80941Smrg    <bits> is negative, or if the sum of <offset> and <bits> is greater than
b8e80941Smrg    the number of bits used to store the operand.  Note that for vector
b8e80941Smrg    versions of bitfieldInsert(), a single pair of <offset> and <bits> values
b8e80941Smrg    is shared for all components.
b8e80941Smrg
b8e80941Smrg    The function bitfieldReverse() reverses the bits of <value>.  The bit
b8e80941Smrg    numbered <n> of the result will be taken from bit (<bits>-1)-<n> of
b8e80941Smrg    <value>, where <bits> is the total number of bits used to represent
b8e80941Smrg    <value>.
b8e80941Smrg
b8e80941Smrg    The function bitCount() returns the number of one bits in the binary
b8e80941Smrg    representation of <value>.
b8e80941Smrg
b8e80941Smrg    The function findLSB() returns the bit number of the least significant one
b8e80941Smrg    bit in the binary representation of <value>.  If <value> is zero, -1 will
b8e80941Smrg    be returned.
b8e80941Smrg
b8e80941Smrg    The function findMSB() returns the bit number of the most significant bit
b8e80941Smrg    in the binary representation of <value>.  For positive integers, the
b8e80941Smrg    result will be the bit number of the most significant one bit.  For
b8e80941Smrg    negative integers, the result will be the bit number of the most
b8e80941Smrg    significant zero bit.  For a <value> of zero or negative one, -1 will be
b8e80941Smrg    returned.
b8e80941Smrg
b8e80941Smrg
b8e80941Smrg    (support for unsigned integer add/subtract with carry-out)
b8e80941Smrg
b8e80941Smrg    Syntax:
b8e80941Smrg
b8e80941Smrg      genUType uaddCarry(genUType x, genUType y, out genUType carry);
b8e80941Smrg      genUType usubBorrow(genUType x, genUType y, out genUType borrow);
b8e80941Smrg
b8e80941Smrg    The function uaddCarry() adds 32-bit unsigned integers or vectors <x> and
b8e80941Smrg    <y>, returning the sum modulo 2^32.  The value <carry> is set to zero if
b8e80941Smrg    the sum was less than 2^32, or one otherwise.
b8e80941Smrg
b8e80941Smrg    The function usubBorrow() subtracts the 32-bit unsigned integer or vector
b8e80941Smrg    <y> from <x>, returning the difference if non-negative or 2^32 plus the
b8e80941Smrg    difference, otherwise.  The value <borrow> is set to zero if x >= y, or
b8e80941Smrg    one otherwise.
b8e80941Smrg
b8e80941Smrg
b8e80941Smrg    (support for signed and unsigned multiplies, with 32-bit inputs and a
b8e80941Smrg     64-bit result spanning two 32-bit outputs)
b8e80941Smrg
b8e80941Smrg    Syntax:
b8e80941Smrg
b8e80941Smrg      void umulExtended(genUType x, genUType y, out genUType msb,
b8e80941Smrg                        out genUType lsb);
b8e80941Smrg      void imulExtended(genIType x, genIType y, out genIType msb,
b8e80941Smrg                        out genIType lsb);
b8e80941Smrg
b8e80941Smrg    The functions umulExtended() and imulExtended() multiply 32-bit unsigned
b8e80941Smrg    or signed integers or vectors <x> and <y>, producing a 64-bit result.  The
b8e80941Smrg    32 least significant bits are returned in <lsb>; the 32 most significant
b8e80941Smrg    bits are returned in <msb>.
b8e80941Smrg
b8e80941Smrg
b8e80941SmrgGLX Protocol
b8e80941Smrg
b8e80941Smrg    None.
b8e80941Smrg
b8e80941SmrgDependencies on ARB_gpu_shader_fp64
b8e80941Smrg
b8e80941Smrg    This extension, ARB_gpu_shader_fp64, and NV_gpu_shader5 all modify the set
b8e80941Smrg    of implicit conversions supported in the OpenGL Shading Language.  If more
b8e80941Smrg    than one of these extensions is supported, an expression of one type may
b8e80941Smrg    be converted to another type if that conversion is allowed by any of these
b8e80941Smrg    specifications.
b8e80941Smrg
b8e80941Smrg    If ARB_gpu_shader_fp64 or a similar extension introducing new data types
b8e80941Smrg    is not supported, the function overloading rule in the GLSL specification
b8e80941Smrg    preferring promotion an input parameters to smaller type to a larger type
b8e80941Smrg    is never applicable, as all data types are of the same size.  That rule
b8e80941Smrg    and the example referring to "double" should be removed.
b8e80941Smrg
b8e80941Smrg
b8e80941SmrgDependencies on NV_gpu_shader5
b8e80941Smrg
b8e80941Smrg    This extension, ARB_gpu_shader_fp64, and NV_gpu_shader5 all modify the set
b8e80941Smrg    of implicit conversions supported in the OpenGL Shading Language.  If more
b8e80941Smrg    than one of these extensions is supported, an expression of one type may
b8e80941Smrg    be converted to another type if that conversion is allowed by any of these
b8e80941Smrg    specifications.
b8e80941Smrg
b8e80941Smrg    If NV_gpu_shader5 is supported, integer data types are supported with four
b8e80941Smrg    different precisions (8-, 16, 32-, and 64-bit) and floating-point data
b8e80941Smrg    types are supported with three different precisions (16-, 32-, and
b8e80941Smrg    64-bit).  The extension adds the following rule for output parameters,
b8e80941Smrg    which is similar to the one present in this extension for input
b8e80941Smrg    parameters:
b8e80941Smrg
b8e80941Smrg       5. If the formal parameters in both matches are output parameters, a
b8e80941Smrg          conversion from a type with a larger number of bits per component is
b8e80941Smrg          better than a conversion from a type with a smaller number of bits
b8e80941Smrg          per component.  For example, a conversion from an "int16_t" formal
b8e80941Smrg          parameter type to "int"  is better than one from an "int8_t" formal
b8e80941Smrg          parameter type to "int".
b8e80941Smrg
b8e80941Smrg    Such a rule is not provided in this extension because there is no
b8e80941Smrg    combination of types in this extension and ARB_gpu_shader_fp64 where this
b8e80941Smrg    rule has any effect.
b8e80941Smrg
b8e80941Smrg
b8e80941SmrgErrors
b8e80941Smrg
b8e80941Smrg    None
b8e80941Smrg
b8e80941Smrg
b8e80941SmrgNew State
b8e80941Smrg
b8e80941Smrg    None
b8e80941Smrg
b8e80941SmrgNew Implementation Dependent State
b8e80941Smrg
b8e80941Smrg    None
b8e80941Smrg
b8e80941SmrgIssues
b8e80941Smrg
b8e80941Smrg    (1) What should this extension be called?
b8e80941Smrg
b8e80941Smrg      UNRESOLVED.  This extension borrows from GL_ARB_gpu_shader5, so creating
b8e80941Smrg      some sort of a play on that name would be viable.  However, nothing in
b8e80941Smrg      this extension should require SM5 hardware, so such a name would be a
b8e80941Smrg      little misleading and weird.
b8e80941Smrg
b8e80941Smrg      Since the primary purpose is to add integer related functions from
b8e80941Smrg      GL_ARB_gpu_shader5, call this extension GL_MESA_shader_integer_functions
b8e80941Smrg      for now.
b8e80941Smrg
b8e80941Smrg    (2) Why is some of the formatting in this extension weird?
b8e80941Smrg
b8e80941Smrg      RESOLVED: This extension is formatted to minimize the differences (as
b8e80941Smrg      reported by 'diff --side-by-side -W180') with the GL_ARB_gpu_shader5
b8e80941Smrg      specification.
b8e80941Smrg
b8e80941Smrg    (3) Should ldexp and frexp be included?
b8e80941Smrg
b8e80941Smrg      RESOLVED: Yes.  Few GPUs have native instructions to implement these
b8e80941Smrg      functions.  These are generally implemented using existing GLSL built-in
b8e80941Smrg      functions and the other functions provided by this extension.
b8e80941Smrg
b8e80941Smrg    (4) Should umulExtended and imulExtended be included?
b8e80941Smrg
b8e80941Smrg      RESOLVED: Yes.  These functions should be implementable on any GPU that
b8e80941Smrg      can support the rest of this extension, but the implementation may be
b8e80941Smrg      complex.  The implementation on a GPU that only supports 32bit x 32bit =
b8e80941Smrg      32bit multiplication would be quite expensive.  However, many GPUs
b8e80941Smrg      (including OpenGL 4.0 GPUs that already support this function) have a
b8e80941Smrg      32bit x 16bit = 48bit multiplier.  The implementation there is only
b8e80941Smrg      trivially more expensive than regular 32bit multiplication.
b8e80941Smrg
b8e80941Smrg    (5) Should the pack and unpack functions be included?
b8e80941Smrg
b8e80941Smrg      RESOLVED: No.  These functions are already available via
b8e80941Smrg      GL_ARB_shading_language_packing.
b8e80941Smrg
b8e80941Smrg    (6) Should the "BitsTo" functions be included?
b8e80941Smrg
b8e80941Smrg      RESOLVED: No.  These functions are already available via
b8e80941Smrg      GL_ARB_shader_bit_encoding.
b8e80941Smrg
b8e80941SmrgRevision History
b8e80941Smrg
b8e80941Smrg    Rev.      Date     Author    Changes
b8e80941Smrg    ----  -----------  --------  -----------------------------------------
b8e80941Smrg     3    31-Mar-2017  Jon Leech Add ES support (OpenGL-Registry/issues/3)
b8e80941Smrg     2     7-Jul-2016  idr       Fix typo in #extension line
b8e80941Smrg     1    20-Jun-2016  idr       Initial version based on GL_ARB_gpu_shader5.