Ant Group NextEvo fully open source AI Infra technology
【數(shù)據(jù)猿導(dǎo)讀】 Ant Group NextEvo fully open source AI Infra technology

On February 1, NextEvo, the AI innovation research and development department of Ant Group, fully opened source AI Infra technology, which can help large model kcal training effective time account for more than 95%, and can achieve "automatic driving" during training, which promotes the efficiency of AI research and development. The technology framework, called DLRover, aims to make large-scale distributed training intelligent. The latest integration into DLRover is the Flash Checkpoint (FCP) scheme. During model training, it is generally necessary to Checkpoint (check point), so that when interrupted, it can be restored to the recent state. The conventional method takes a long time, the high-frequency check point is easy to reduce the training available time, and the low frequency check point is lost too much when recovering. After the training of the kilocarb parameter model, the training waste time caused by Checkpoint is reduced by about 5 times, the persistence time is reduced by about 70 times, and the effective training time is increased from 90% to 95%.
來源:DIYuan
刷新相關(guān)文章
我要評論
不容錯過的資訊
-
1董明珠怒了!獵豹移動CEO傅盛口不擇言,
-
2孫正義一路在賣,馬云暗中在買,阿里第一
-
3*ST左江涉嫌重大財務(wù)造假;Meta發(fā)布開源
-
4分拆訊飛醫(yī)療上市,科大訊飛的新故事能打
-
5深圳數(shù)據(jù)交易所迎來首家新加坡數(shù)商;科大
-
6Baichuan Intelligence released more
-
7從創(chuàng)新者到引領(lǐng)者:探索第四范式的AI之旅
-
8【金猿投融展】永洪科技——釋放數(shù)據(jù)價值
-
9市值193億!哈工大博導(dǎo)帶出一個IPO
-
10「共營」當(dāng)下,數(shù)見未來!2023第九屆GDMS
大數(shù)據(jù)企業(yè)推薦more >
大家都在搜
