Wise Source Research Institute Launches Flag Eval "Scales" Big Model Assessment System...
DIYuan | 2023-06-09 18:34
【數(shù)據(jù)猿導(dǎo)讀】 Wise Source Research Institute Launches FlagEval "Scales" Big Model Assessment System

June 9 morning news, 2023 Beijing wisdom source conference, wisdom source research institute director Huang Tiejun announced the launch of FlagEval (scales) large language model evaluation system, aimed at "ability, task, indicators" from the three-dimensional evaluation perspective, more than 600 dimensions of the large model for a comprehensive evaluation, to establish a scientific, fair and comprehensive The system aims to establish a scientific, fair and comprehensive technical evaluation system for the Big Model. According to the introduction, the task dimension of the big model currently includes 22 subjective and objective assessment data sets, with as many as 84,433 assessment questions. Currently exploring the use of artificial intelligence technology for scientific evaluation, and strive to reduce more subjective evaluation. It is also exploring the use of large model evaluation to assist in large model pre-training.
來(lái)源:DIYuan
刷新相關(guān)文章
我要評(píng)論
不容錯(cuò)過(guò)的資訊
-
1算一筆細(xì)賬,ChatGPT、文心一言這類大模
-
2商湯與上海AI實(shí)驗(yàn)室等發(fā)布“書生·浦語(yǔ)”
-
3左手AI,右手回購(gòu),百融云創(chuàng)認(rèn)真兌現(xiàn)高成
-
4廣發(fā)證券傳媒互聯(lián)網(wǎng)首席分析師曠實(shí):大模
-
5Netease "against the water cold"
-
6全球頂尖科學(xué)家陳松蹊院士出任百分點(diǎn)數(shù)據(jù)
-
7DSM預(yù)告 | 地理時(shí)空數(shù)據(jù)專題供需對(duì)接
-
8公司老板被AI詐騙430萬(wàn);AI圖像編輯技術(shù)D
-
9馬斯克稱英偉達(dá)不會(huì)永遠(yuǎn)壟斷AI芯片市場(chǎng);
-
10華人高管加入OpenAI;富士通發(fā)布AI平臺(tái);
大數(shù)據(jù)企業(yè)推薦more >
大家都在搜
