-
Notifications
You must be signed in to change notification settings - Fork 74.7k
Pull requests: tensorflow/tensorflow
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
PR #28471: [XLA:GPU] allow lowering DynamicMemCopy thunk when it depends on loop iteration
#96338
opened Jul 3, 2025 by
copybara-service
bot
•
Draft
Fix XLA crash when the product of tensor dimensions overflows.
#96337
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[XLA:CPU][nanort] Store program shape in NanoRtExecutable.
#96336
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
Add TraceMes to PyGrain for measuring input wait time on host and attributing device idleness to input pipeline slowness.
#96335
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[xla:cpu] Parallelize matrix-vector products
#96333
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
Erros with missing .so files when running wheel test
#96332
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
Integrate LLVM at llvm/llvm-project@696c0f92e0fe
#96330
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[XLA] Refactor HloDCE to improve readability.
#96328
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[XLA] Add should_inline callback to CallInliner
#96325
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
Implement slinky thread pool using
Eigen::ThreadPoolInterface
. This should work, and be *reasonably* efficient, but it is not as efficient as slinky::thread_pool_impl would be directly. This is effectively an extra layer of task overhead to run each worker task (it should still better than enqueuing a new task for each iteration of a parallel loop).
#96323
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[xla:cpu] Parallelize compiled dot along the batch dimension
#96322
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
Use
-Wl,-install_name
for CPU plugin on MacOS
#96321
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[TFLite] Fix incorrect
FuseBinaryOpWithTransposeConvNoneBias
optimization
#96320
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[xla:cpu] Make DotLibraryRewriter support greedy fusion mode.
#96319
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
Optimize DiceSimLimitedSubgraph by pre allocating vectors.
#96316
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[HLO Diff] Optimize Ancestor LCS implementation by reducing memory allocations.
#96315
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[XLA:GPU] do not fail in nest gemm fusion if we cannot convert the computation
#96314
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[XLA:GPU] nest gemm fusion test: separate tests that are parametrized on reshape op
#96313
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[Hlo Diff] Simplify ExactSubgraphMatcher impl.
#96312
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
[XLA:GPU] Add DotOperandDims::InsertDimension()
#96310
opened Jul 2, 2025 by
copybara-service
bot
•
Draft
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.