news

Apple responds to using controversial YouTube resources to train AI: OpenELM model is only for research

2024-07-18

한어Русский языкEnglishFrançaisIndonesianSanskrit日本語DeutschPortuguêsΕλληνικάespañolItalianoSuomalainenLatina

IT Home reported on July 18 that Apple released a statement through technology media 9to5Mac regarding the use of controversial YouTube resources for training its OpenELM open source AI model. OpenELM is not used in any other AI or machine learning projects (including Apple Intelligence).

The nonprofit news studio ProofNews released an investigative report stating that Apple used a dataset called YouTube Subtitles, which is 5.7GB (489 million words) in size, when training the AI ​​model OpenELM.

The dataset was created by EleutherAI and was first released in 2020. It involves subtitle content of 173,536 YouTube videos from more than 48,000 channels, including subtitle content of more than 12,000 videos that have been deleted by the platform.

Apple said in its latest statement that OpenELM The purpose of the model is to contribute to the research community and promote the development of open source large-scale language models

Apple researchers have described OpenELM as a "state-of-the-art open language model."

Apple emphasizes that OpenELM is for research purposes only.Not used for any commercial Apple Intelligence features,The model is released as open source and developers can access it freely.

Apple also said that there are no plans to build a new version of the OpenELM model at this stage.