Fix op register errors when skip cuda (#69) Fix op register errors when skip cuda, add some optimizer ops for reference backend
Fix op register errors when skip cuda (#69)
Fix op register errors when skip cuda, add some optimizer ops for reference backend
TransformerEngine-FL is a fork of TransformerEngine that introduces a plugin-based architecture for supporting diverse AI chips, built on top of FlagOS, a unified open-source AI system software stack.