# | Article Title | News Link | Source Image | When Posted | Writer | Article Thumb |
---|---|---|---|---|---|---|
1 | 'Humanity's Last Exam' benchmark is stumping top AI models - can you do any better? | https://news.google.com/read/CBMirAFBVV95cUxNZXhVQVdZVG9KeHU5aHZrVEw2aDlOaDZTd3BGQjVCUmtyNVpPNmU5TW1oZGlNZk9WVDdGMDF3MGladWNHN3VDNkRrU28tN2pfS281dVVmblRSNTNEZGUyeU96ZnhwWUJDMnpnYVF6WUx3NXdOblh4ZVEta1dlUEl6WTZQaHd3Vk53RVItN3FOWXE5bEpGQW5OMllkQWZnOG13cjh4eEwtQ3lBV2Vs?hl=en-CA&gl=CA&ceid=CA%3Aen | https://encrypted-tbn3.gstatic.com/faviconV2?url=https://www.zdnet.com&client=NEWS_360&size=96&type=FAVICON&fallback_opts=TYPE,SIZE,URL | Jan 27 | By Radhika Rajkumar | https://news.google.com/api/attachments/CC8iK0NnNUViRWxEYTJ0RVlVeFFUbWcyVFJDZkF4ampCU2dLTWdZNU00b3lJUWs=-w200-h112-p-df-rw |
2 | Can AI really compete with human data scientists? OpenAI’s new benchmark puts it to the test | https://news.google.com/read/CBMiuAFBVV95cUxNaUJJV1NENEF6YTRIS0VhVS05MGIteTJ0RkZubWdWRmVnMkVhaUJHeWFfZmJVOFZvVGF1WXUtdkttUnNZQkxaemV6TDFMMkdENWQ3NzdMM1E2N3daNVFDSTVPNVhnT2NDeGJtMjl0UnhNTjB1NkwyeGxvOE04RDhpc0o1Zk5RWTFGb2UtNUlxQ3RYMlRicDk4VVRyS29INkZ4ek85UktzMk0yQW5rWFNPbEtIZkRvWGRz?hl=en-CA&gl=CA&ceid=CA%3Aen | https://encrypted-tbn3.gstatic.com/faviconV2?url=https://venturebeat.com&client=NEWS_360&size=96&type=FAVICON&fallback_opts=TYPE,SIZE,URL | Oct 10 | By Michael Nunez | https://news.google.com/api/attachments/CC8iK0NnNTJaMjVOVFZVNVJscHZRbUZmVFJEZ0FSaVFBeWdLTWdZQklZelZGQW8=-w200-h112-p-df-rw |
3 | An AI system has reached human level on a test for ‘general intelligence’. Here’s what that means | https://news.google.com/read/CBMixwFBVV95cUxObVdDdllnVjNqbVJKaWtrLWpfLVJPdkx6a1JmRnRFOUtROGlxUzN0OWNoYnl3bXBka2lhcnk0Y3BKWjUtTFlOS2dzQlhlWnZrY1pLdnduNDItNDUzcjRtZ2QtQUNaSGdUSEhJQXhmSkpDTmp5d1QxcEdXUHVTcmhyb3R2djRXRWZPS2YzLTAtRXZ5T3pVVU9NRUxzNzBsZWtUOVpGVFFNTEIxajBuZHNTTFZCaUNRTmQ3X0NsQlM5TVE1ME9mdFI4?hl=en-CA&gl=CA&ceid=CA%3Aen | https://encrypted-tbn0.gstatic.com/faviconV2?url=https://theconversation.com&client=NEWS_360&size=96&type=FAVICON&fallback_opts=TYPE,SIZE,URL | Dec 24 | By Michael Timothy Bennett & Elija Perrier | https://news.google.com/api/attachments/CC8iK0NnNTVkWHBOWVVwMFdEVkNOVFI2VFJDcUF4aUFCU2dLTWdZQndJYW54QU0=-w200-h112-p-df-rw |
4 | Researchers just stumped AI with their most difficult test — but for how long? | https://news.google.com/read/CBMihgFBVV95cUxNVjA3Zk5UWjdTNDk3ZHVWd1YxbHdjMTcxVmpCblc4SGQ4N1lLMGxRejBRTHg2aHF2VGJhNUY1N2NWajFRSFhMYWZHWkhVSG5zS0p3dEZyMm5rS0gxM2hRM0ppejAtWkxPRFVTN1MzMGlkNGhPNVBkN00xUkdXTVMzR2ZCZG1IZw?hl=en-CA&gl=CA&ceid=CA%3Aen | https://encrypted-tbn2.gstatic.com/faviconV2?url=https://qz.com&client=NEWS_360&size=96&type=FAVICON&fallback_opts=TYPE,SIZE,URL | Jan 23 | By Britney Nguyen | https://news.google.com/api/attachments/CC8iK0NnNDBiV3cxTFZkbUxTMXVNRWhGVFJDZkF4ampCU2dLTWdZZFZZek1yUVU=-w200-h112-p-df-rw |
5 | OpenAI’s deep research can complete 26% of Humanity’s Last Exam | https://news.google.com/read/CBMiekFVX3lxTE5pQ3RoSTRkbk1xcDA0MTYyUFUxa1ZYNWpMcHJjbTJfSG5lSEhWd01Ndmcza0UtTUtLQzZpajREaERlZTdxaDZ5eGtnd0hJeWtndzlZcmFYbmxjVlRqZXFJaUd4Z1JJUlhBbkM3Q0VoSkVNeEllUE9sUUZB?hl=en-CA&gl=CA&ceid=CA%3Aen | https://encrypted-tbn3.gstatic.com/faviconV2?url=https://fortune.com&client=NEWS_360&size=96&type=FAVICON&fallback_opts=TYPE,SIZE,URL | Feb 11 | By Greg McKenna | https://news.google.com/api/attachments/CC8iK0NnNWFNV0pFU1VkSVdXUjRSM3AyVFJERUF4aW1CU2dLTWdhbFZvd01KZ2c=-w200-h112-p-df-rw |
Keyword |
---|
human benchmark |