Releases: ROCm/hipTensor
Releases · ROCm/hipTensor
hipTensor 2.0.0 for ROCm 7.0.1
hipTensor code for ROCm 7.0.1 did not change. The library was rebuilt for the updated ROCm 7.0.1 stack.
hipTensor 2.0.0 for ROCm 7.0.0
Added
- Added element-wise binary operation support.
- Added element-wise trinary operation support.
- Added support for new GPU target gfx950.
- Added dynamic unary and binary operator support for element-wise operations and permutation.
- Added a CMake check for
f8
datatype availability. - Added
hiptensorDestroyOperationDescriptor
to free all resources related to the provided descriptor. - Added
hiptensorOperationDescriptorSetAttribute
to set attribute of ahiptensorOperationDescriptor_t
object. - Added
hiptensorOperationDescriptorGetAttribute
to retrieve an attribute of the providedhiptensorOperationDescriptor_t
object. - Added
hiptensorCreatePlanPreference
to allocate thehiptensorPlanPreference_t
and enabled users to limit the applicable kernels for a given plan or operation. - Added
hiptensorDestroyPlanPreference
to free all resources related to the provided preference. - Added
hiptensorPlanPreferenceSetAttribute
to set attribute of ahiptensorPlanPreference_t
object. - Added
hiptensorPlanGetAttribute
to retrieve information about an already-created plan. - Added
hiptensorEstimateWorkspaceSize
to determine the required workspaceSize for the given operation. - Added
hiptensorCreatePlan
to allocate ahiptensorPlan_t
object, select an appropriate kernel for a given operation and prepare a plan that encodes the execution. - Added
hiptensorDestroyPlan
to free all resources related to the provided plan.
Changed
- Removed architecture support for gfx940 and gfx941.
- Generalized opaque buffer now for any descriptor.
- Replaced
hipDataType
withhiptensorDataType_t
for all supported types, for example,HIP_R_32F
toHIPTENSOR_R_32F
. - Replaced
hiptensorComputeType_t
withhiptensorComputeDescriptor_t
for all supported types. - Replaced
hiptensorInitTensorDescriptor
withhiptensorCreateTensorDescriptor
. - Changed handle type and API usage from
*handle
tohandle
. - Replaced
hiptensorContractionDescriptor_t
withhipTensorOperationDescriptor_t
. - Replaced
hiptensorInitContractionDescriptor
withhiptensorCreateContraction
. - Replaced
hiptensorContractionFind_t
withhiptensorPlanPreference_t
. - Replaced
hiptensorInitContractionFind
withhiptensorCreatePlanPreference
. - Replaced
hiptensorContractionGetWorkspaceSize
withhiptensorEstimateWorkspaceSize
. - Replaced
HIPTENSOR_WORKSPACE_RECOMMENDED
withHIPTENSOR_WORKSPACE_DEFAULT
. - Replaced
hiptensorContractionPlan_t
withhiptensorPlan_t
. - Replaced
hiptensorInitContractionPlan
withhiptensorCreatePlan
. - Replaced
hiptensorContraction
withhiptensorContract
. - Replaced
hiptensorPermutation
withhiptensorPermute
. - Replaced
hiptensorReduction
withhiptensorReduce
. - Replaced
hiptensorElementwiseBinary
withhiptensorElementwiseBinaryExecute
. - Replaced
hiptensorElementwiseTrinary
withhiptensorElementwiseTrinaryExecute
. - Removed function
hiptensorReductionGetWorkspaceSize
.
hipTensor 1.5.0 for ROCm 6.4.4
hipTensor code for ROCm 6.4.4 did not change. The library was rebuilt for the updated ROCm 6.4.4 stack.
hipTensor 1.5.0 for ROCm 6.4.3
hipTensor code for ROCm 6.4.3 did not change. The library was rebuilt for the updated ROCm 6.4.3 stack.
hipTensor 1.5.0 for ROCm 6.4.2
hipTensor code for ROCm 6.4.2 did not change. The library was rebuilt for the updated ROCm 6.4.2 stack.
hipTensor 1.5.0 for ROCm 6.4.1
hipTensor code for ROCm 6.4.1 did not change. The library was rebuilt for the updated ROCm 6.4.1 stack.
rocm-6.1.5
ROCm release v6.1.5
hipTensor 1.5.0 for ROCm 6.4.0
Added
- Added benchmarking suites for contraction, permutation, and reduction. YAML files are categorized into bench and validation folders for organization
- Added emulation test suites for contraction, permutation, and reduction
- Support has been added for changing the default data layout using the
HIPTENSOR_DEFAULT_STRIDES_COL_MAJOR
environment variable
Changed
- Used
GPU_TARGETS
instead ofAMDGPU_TARGETS
incmakelists.txt
Optimized
- Optimized the hyper-parameter selection algorithm for permutation
Resolved issues
- For CMake bug workaround, set
CMAKE_NO_BUILTIN_CHRPATH
whenBUILD_OFFLOAD_COMPRESS
is unset
hipTensor 1.4.0 for ROCm 6.3.3
hipTensor code for ROCm 6.3.3 did not change. The library was rebuilt for the updated ROCm 6.3.3 stack.
hipTensor 1.4.0 for ROCm 6.3.2
hipTensor code for ROCm 6.3.2 did not change. The library was rebuilt for the updated ROCm 6.3.2 stack.