# | Article Title | News Link | Source Image | When Posted | Writer | Article Thumb |
|---|---|---|---|---|---|---|
1 | New 'renewable' benchmark streamlines LLM jailbreak safety tests with minimal human effort | https://news.google.com/read/CBMiiAFBVV95cUxPVEoxcFBmcUlQbHBLR3FMeDVzN3k0a3R1dlFXVDNHaVUtYnR2X0dpLVY5VEZKT2N3bXlMc242UUdFMnhYdWtIaWV0SzlkNTNDSHJBY0FzNExmeWx3UDdjWnptZ21BQ1d0NWNILXFweHVNZXVrcVctNmNMUkFHOGRHX0dvQUtwMjFE?hl=en-CA&gl=CA&ceid=CA%3Aen | https://encrypted-tbn1.gstatic.com/faviconV2?url=https://techxplore.com&client=NEWS_360&size=96&type=FAVICON&fallback_opts=TYPE,SIZE,URL | 19 days ago | By Jaimie Patterson | https://news.google.com/api/attachments/CC8iK0NnNXRMVkJUWVZkWFdXNHpMWGxpVFJDN0Foak1CeWdLTWdhQllZZ0pLZ1U=-w200-h112-p-df-rw |
2 | I benchmarked myself instead of my PC and discovered a lot | https://news.google.com/read/CBMiggFBVV95cUxOZzdDRFNjdG5vaERzWVlJVzVCaWlmb21kZ2dsMjdBOWp4VHNqZzFNdFBsRnFOUzdnU2FfekFRWWpkZ1VEQmZ0SkpBTTEtV2l6azN4dGdVLWZmRkZYLWphbGlFRlh6d3VILXhmRHpIWElJU2xUWDVqS29zd21nbl8zM0FB?hl=en-CA&gl=CA&ceid=CA%3Aen | https://encrypted-tbn2.gstatic.com/faviconV2?url=https://www.makeuseof.com&client=NEWS_360&size=96&type=FAVICON&fallback_opts=TYPE,SIZE,URL | Nov 28 | By Ben Stegner | https://news.google.com/api/attachments/CC8iK0NnNVJORVp6VkVrNFpuVkVRVm8wVFJDZkF4ampCU2dLTWdZNVJJcHhwUVU=-w200-h112-p-df-rw |
3 | Don’t Panic: ‘Humanity’s Last Exam’ has begun | https://news.google.com/read/CBMiigFBVV95cUxQeG15bVNobzhzb0U1LThCY0t1NzFTZXI1VXpkX0RCZXF6QzczZFNHOXh6U3liZVhrOWVVTE1iWUFoU3JBNWxTSk03T1pyRGh3Yk0zTDRXQTdWSzJCWFNFQkQ2WW1mVlJJNVNwdVhDSmtEMWFQLW8tMlZkWWl5NmlkbVN0YUs5c0VoX2c?hl=en-CA&gl=CA&ceid=CA%3Aen | https://encrypted-tbn3.gstatic.com/faviconV2?url=https://stories.tamu.edu&client=NEWS_360&size=96&type=FAVICON&fallback_opts=TYPE,SIZE,URL | Feb 25 | By Lesley Henton | https://news.google.com/api/attachments/CC8iK0NnNDJhakpNYkZWWVRua3RkblZxVFJEV0FoamdCQ2dLTWdhZFk0cHNLUWc=-w200-h112-p-df-rw |
4 | A new AI benchmark tests whether chatbots protect human well-being | https://news.google.com/read/CBMiogFBVV95cUxPWldyVF9UbFAtMlV1QUpFWnVoSVhmdWxldkVNX3RmSUExdUNRN05fbW5zM05Ed0ZJUWNreVBxMmc4MFQ5dm1URzVoOWIzZWxKTDFLWWdmOXhXNkx3OXRtNFVPV1N0VTRwUndySzdYT2hWNXFlRXRvVmZjRnljZFVlNHdiOW1jN1UzXzZQQUxCa2wzS20wZkpLUGJoSWloa1Rxc0E?hl=en-CA&gl=CA&ceid=CA%3Aen | https://encrypted-tbn1.gstatic.com/faviconV2?url=https://techcrunch.com&client=NEWS_360&size=96&type=FAVICON&fallback_opts=TYPE,SIZE,URL | Nov 24 | By Rebecca Bellan | https://news.google.com/api/attachments/CC8iK0NnNVJaamQ1WkhBMGVVeDFjME5WVFJERUF4aW1CU2dLTWdZTkZZQ3V4UUk=-w200-h112-p-df-rw |
5 | Sony has a new benchmark for ethical AI | https://news.google.com/read/CBMiigFBVV95cUxQaTlkb1dRdnUta3hGdjRnRVhUMDFrcE5CVnFxY0V6MjkyUTFSM2U4c0xvQzJLU25zdG9RazUyVVd0c3F6c2Q4d0tQTDl0Z3VpUkhSdUM2b2RJNmxrem1qY2YzNmZlOGFjeTAxZ3MwQUw1bTBxSm9vVUVrUEZjTE44Qk1MZ0txRmV4WGc?hl=en-CA&gl=CA&ceid=CA%3Aen | https://encrypted-tbn1.gstatic.com/faviconV2?url=https://www.engadget.com&client=NEWS_360&size=96&type=FAVICON&fallback_opts=TYPE,SIZE,URL | Nov 5 | By Will Shanklin | https://news.google.com/api/attachments/CC8iJ0NnNWlkbG80VW5sUWJXNVhNblJQVFJDZkF4ampCU2dLTWdPTmRndw=-w200-h112-p-df-rw |
Keyword |
|---|
human benchmark |