news

the wenxin model has an average daily call volume of over 700 million, and baidu is trying hard to find opportunities for its implementation

2024-09-26

한어Русский языкEnglishFrançaisIndonesianSanskrit日本語DeutschPortuguêsΕλληνικάespañolItalianoSuomalainenLatina

interface news reporter | cui peng

interface news editor | song jianan

9moon25daymorningbaiducloud intelligenceat the conferenceroll outbai geAIheterogeneous computingplatform4.0andqianfan large modelplatform3.0waitAIinfrastructure productsand announcebemultipleAIrelated businessofup to datedatain,literary mindlarge modeldaily averagecallquantityalreadyexceed7100 milliondistancebaidulast timeannouncementof6100 milliondatahavefurtherpromote

existjust endedaliyunqi conferencesuperioralibabaCEOwu yongmingeverexpressalibaba cloudofsingle networkclusteralreadyexpand to10ten thousandcardlevel,andbaidu alsounwilling tobehindshen dou, executive vice president of baidu group and president of baidu intelligent cloud business groupspecialemphasizebai ge4.0willnot onlyyeswankaclusterratherhavemature10ten thousandcardclusterdeployandmanagement capabilities

baiduthis yearalwaysemphasizelarge modeloflandingapplicationbaidu ceo robin li in a recent internal speechindicated inhaveapplication scenariocancontinuous iterationupgradeoflarge model,andothermodelthe gap between products willgetting bigger and bigger.

based on this,baidufound itchangan automobileandsamsungwaitlargecustomers stand for them,byexhibitthe application results of baidu's big model in various industries.

shen douexpressin the past year,baidufeelclientofmodelneedsurgethe required cluster size is getting larger and larger.enterpriserightmodelinference costdeclineofexpectedalsogetting higher

training large modelsthe premise iscreatecluster, this is not just about buying a gpu and assembling it.generallyneeda few monthsconductequipmentconfigurationanddebug

previously there wascloud vendorsmentioned, forming a clustercancompressionarrive1sky,andshen dousaybai ge4.0be able to dofastest1hours to completeformation,mainusewillpopular training in the industrytoolandframeconductbuilt-in way.

onceenterlarge-scale trainingstagemost importantthat isstabilitythe field of large models has always followed the famous scaling law, which believes that model performance will improve as the parameters, computing power, and data set size increase.

according to shen dou, gPUthe cluster needs to consumehugeofconstructionand operating costs, usually builtonewankaclusteroneyesGPUofpurchasecostat oncegundambillionsyuan.ifyes10ten thousandcardclusterserverone dayconsumptionofpoweraboutyes300ten thousandkwhequivalent tobeijingdongcheng districtone dayofresidential electricity consumption

existthislarge scaleofon the clusterhardwareinevitablemeetingappearfaultthe bigger the scalefaultyprobabilitythe higherexistthesefailurethe vast majorityyesdepend onGPUcaused bybecauseGPUyesvery sensitivehardwarerighttemperaturehumiditywaitenvironmentfluctuationresponsive

shen dou mentionedwhen meta trained the llama3 model, a cluster of 16,000 gpu cards was used, and a failure occurred on average once every 3 hours.

large modeltrainyeshugesingle taskonenodeerrorentireclusterjust needstop,androllbackarrivepreviousmemory pointconsideringGPUclusterofcostveryexpensiveeverymanystopone minutewillwaste of moneyeffective trainingduration"it becomesveryimportant indicators

againstlarge modeltrainin processfrequent failuresofquestionbai ge4.0rightfault detectionmeansandautomatic fault tolerance mechanismconductbeupgradeat presentwankaclustersuperiorofeffective training timeachieve99.5%shen dousaythis is higher thanpeersopponentofdataperformance.also,bai ge4.0willmainstreamlong textinference efficiencypromotebe1timesaboveat the same timereducedinference cost

in baidu's latest earnings call,robin lieverit was revealed that in the second quarter, the revenue contribution of baidu smart cloud ai further increased to 9%, compared with 6.9% in the previous quarter.

large modeltoolofperformancepromoteof courseimportantbut forbaiduto saymodel landingofresultsmore realistic

in addition to upgradingbai gein addition to the platform, baidu also introducedlatestqianfan3.0platform. according to the data released by shen dou,on the qianfan big model platform, wenxin big model is called more than 700 million times per day on average, and has helped users fine-tune 30,000 big models and develop more than 700,000 enterprise-level applications.qianfan 3.0 canit can call nearly 100 large models from home and abroad, including the wenxin series of large models, and also supports the calling of various traditional small models such as voice and vision.

at present,large modellandingofthree majormain requirementsthey areapplication developmentmodelreasoningandmodeldevelopment

existapplication development layerenterpriseRAGwillenterpriseandindustry datamakehacksknowledge basegivelarge modelandAgentagentyestwo majorcommonoflarge modellandingscenario

after receiving the task, the agent will think independently, break down the task, plan the solution, and call the tools to complete the whole process autonomously.cancomplete the pastneed3 to4indivualAPPtalentfinishedtask

shen douexpressbaiduinternalforqianfanplatformsupplybebaidu searchsearchandbaidu mapwaitexceed80indivualofficial componentsused forpromoteagentexiston specific tasksability

and inin li yanhong’s view, intelligent agents are large models.developmentalnextimportant direction."many people are optimistic about the development direction of intelligent agents, but to date, there is still no consensus on intelligent agents. there are not many companies like baidu that regard intelligent agents as the most important strategy and development direction of large models."

baidu released three products at this year's create conference, namely agentbuilder, appbuilder and modelbuilder. among them, agentbuilder and appbuilder are both related to intelligent agents, one has a lower threshold, and the other emphasizes functionality.

according to the latest data revealed by baidu, the distribution volume of intelligent bodies in the baidu ecosystem has increased significantly, with the average daily distribution times exceeding 8 million in july, which is twice the data in may.

baidudigital humanplatformandsmart customer service products are also availablelatest developmentsamong them, xiling digital human platform 4.0 supports the rapid generation of 3d digital human images and videos with different makeup and industry characteristics based on text, and reduces the price of 3d hyper-realistic digital humans from 10,000 yuan to 199 yuan.

the intelligent customer service product "keyue" has been optimized in terms of user intent understanding and multimodal information exchange, improving its ability to handle complex problems.

according to baidu, the industry“problem self-resolution rate”ofthe average level is 80%, and after the upgrade, "keyue" has raised this indicator to 92%. the product has helped corporate customers serve more than 150 million people and interacted more than 500 million times.

report/feedback