惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

V
Vulnerabilities – Threatpost
U
Unit 42
F
Fortinet All Blogs
aimingoo的专栏
aimingoo的专栏
P
Proofpoint News Feed
F
Full Disclosure
月光博客
月光博客
Engineering at Meta
Engineering at Meta
博客园_首页
The Register - Security
The Register - Security
G
Google Developers Blog
The Cloudflare Blog
博客园 - Franky
K
Kaspersky official blog
A
Arctic Wolf
Scott Helme
Scott Helme
C
Cisco Blogs
Hugging Face - Blog
Hugging Face - Blog
C
Check Point Blog
NISL@THU
NISL@THU
AI
AI
D
DataBreaches.Net
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Stack Overflow Blog
Stack Overflow Blog
Project Zero
Project Zero
The GitHub Blog
The GitHub Blog
H
Hackread – Cybersecurity News, Data Breaches, AI and More
量子位
Vercel News
Vercel News
T
Tor Project blog
P
Privacy International News Feed
D
Docker
I
Intezer
L
LangChain Blog
P
Proofpoint News Feed
Security Latest
Security Latest
C
CXSECURITY Database RSS Feed - CXSecurity.com
T
Threatpost
博客园 - 聂微东
AWS News Blog
AWS News Blog
Martin Fowler
Martin Fowler
P
Privacy & Cybersecurity Law Blog
V
V2EX
Last Week in AI
Last Week in AI
C
Cybersecurity and Infrastructure Security Agency CISA
The Hacker News
The Hacker News
T
Tenable Blog
Blog — PlanetScale
Blog — PlanetScale
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Tailwind CSS Blog

The Guardian Engineering Blog

The Guardian Engineering Blog - Day in the Life: Stefano Le Pera The Guardian Engineering Blog - Berger Hack Day: more global, more digital and more visual The Guardian Engineering Blog - Faster, cheaper, messier: lessons from our switch to self-hosted GitHub Actions The Guardian Engineering Blog - Day in the Life: Simon Adcock The Guardian Engineering Blog - The end of password pain: building frictionless authentication at the Guardian The Guardian Engineering Blog - Cooking up recipe data for the Feast app The Guardian Engineering Blog - Day in the Life: Alex Guild The Guardian Engineering Blog - Parsing: the merit of strictly typed JSON The Guardian Engineering Blog - Hack Day: Summer of Sport Fall of Democracy The Guardian Engineering Blog - When security matters: working with Qubes OS at the Guardian The Guardian Engineering Blog - Pinboard: transforming communication across the newsroom (part 3 of 3) The Guardian Engineering Blog - Pinboard: transforming communication across the newsroom (part 2 of 3) The Guardian Engineering Blog - Pinboard: transforming communication across the newsroom (part 1 of 3) The Guardian Engineering Blog - The Digital Fellowship is your foot in the door to the future of news
The Guardian Engineering Blog - Large language models and generative AI: a recent hack day
Rasha Ardati · 2023-12-22 · via The Guardian Engineering Blog

The discussion of large language models (LLMs) and generative artificial intelligence was everywhere in 2023 – not least in the Guardian’s Product and Engineering department. Hack days are a staple part of the software development culture, so it was no surprise that at this year’s final hackathon, several developers and data scientists focused their attention in this area – covering potential applications in podcasting, search and image generation. And who wouldn’t want a browser extension that assesses the mood of a news article and finds an appropriate music track to enhance your reading experience?

In total, the teams, including colleagues from across the Guardian, produced and presented 24 hacks in the course of the two-day event – some of which took an entirely different turn. These included: a product that spits out cultural recommendations; a dedicated platform for showcasing Guardian documentaries; experiments with ChatGPT and Google Bard involving tools like Trello to improve efficiency; and a Guardian-themed generative AI screensaver.

Here are three acts that won an award:

crosswordsplus hack screenshot
crosswordsplus hack screenshot Photograph: Dana Dramowiczs/The Guardian

Best technical hack: CrosswordsPlus

Play crosswords with your friends in real time with the Guardian’s Puzzles and Crosswords app on iOS. Built using SharePlay, a native iOS framework, this hack enabled multiplayer functionality even if the user is not signed-in. Apple takes care of the data transfer and offers end-to-end encryption – you just simply start a session using FaceTime, iMessage or AirDrop.

Best entertaining hack: 5/15s in 515s

A hack which takes a weekly “5/15” report – through which teams update on their progress – and converts it into an audio podcast that is 515 seconds long. Perfect for any manager who might not otherwise have time to read everything! The hack works by using an OpenAI GPT-3.5 model to summarise each team’s report before converting this into audio using a ext to speech model.

5/15s hack presentation screenshot
5/15s hack presentation screenshot Illustration: Mahesh Makani/The Guardian

Most Artificially Intelligent hack: Linguini Labelling Method

Building labelled training datasets is crucial for successful machine learning projects, especially in Natural Language Processing (NLP) – but this process is often laborious and time-consuming. Quite often you need multiple annotators labelling thousands of examples to build a performant model, and ideally you want them to annotate the same examples and to reach a good level of agreement.

The idea for this hack was to leverage the ability of Large Language Models (LLMs) to recognise concepts in text and use them as additional annotators to speed up the process and make it more robust. The proposed pipeline integrates various LLMs alongside human annotators: when both entities agree on a data point, it seamlessly enters the training dataset; if discrepancies arise, a human review is triggered, focusing efforts solely on annotating “difficult” examples, maximising efficiency.

LLM hack presentation screenshot
LLM hack presentation screenshot Photograph: AnnaV Vissens/The Guardian