Posts

New ask Hacker News story: Tell HN: My daily game won a Players Choice Award

Tell HN: My daily game won a Players Choice Award 7 by paulhebert | 2 comments on Hacker News. I've shared my game Tiled Words a few times in "What are you working on" threads and as a Show HN. I wanted to share with y'all that today it won the Players' Choice Award at the 2025 Playlin Daily Game Awards! It was also runner up for Best Word Game and a finalist for Best Classic Game Reimagined and Best Visual Design. Thanks to everyone herewho commented or played. Your feedback and encouragement has made Tiled Words the game it is today. I designed and developed the game and make the puzzles with my wife. We would have stopped long ago if not for the positive feedback from the community. Playlin is a really cool organization and all of the winners are fantastic games that you should try: https://ift.tt/Fa0eLg8 And if you haven't played Tiled Words yet, give it a try here: https:// tiledwords.com

New ask Hacker News story: Seeking Advice on Improving OCR for Watermarked PDFs in My RAG Pipeline

Seeking Advice on Improving OCR for Watermarked PDFs in My RAG Pipeline 2 by hundredtrillion | 0 comments on Hacker News. I’ve been developing a small RAG pipeline and ran into a specific technical issue involving OCR. I’m using PyMuPDF for extraction, and whenever a PDF contains a centered watermark on each page, the OCR becomes noisy—text breaks, artifacts show up, and the output degrades enough that it affects chunking and retrieval accuracy downstream. The document is otherwise clean, so I’m trying to understand whether this is a known limitation of PyMuPDF or if there are better approaches for handling watermarked PDFs before OCR. I’m working with an RTX 4000 (8GB VRAM), so I’m also trying to stay within reasonable GPU constraints. I’d really appreciate any ideas on: more robust OCR libraries or models that handle watermarks well preprocessing strategies to suppress watermark text better extraction pipelines for RAG use cases or any general advice on improving this part of the ...

New ask Hacker News story: Ask HN: Anthropic has stood its ground. What excuse is left for other companies?

Ask HN: Anthropic has stood its ground. What excuse is left for other companies? 4 by chirau | 1 comments on Hacker News.

New ask Hacker News story: Tell HN: MitID, Denmark's digital ID, was down

Tell HN: MitID, Denmark's digital ID, was down 95 by mousepad12 | 143 comments on Hacker News. MitID is the sole digital ID provider, leading the entire country unable to log into their internet banking, public services, digital mail etc. https://ift.tt/hlxvwD6...

New ask Hacker News story: Ask HN: How do you handle duplicate side effects when jobs, workflows retry?

Ask HN: How do you handle duplicate side effects when jobs, workflows retry? 2 by shineDaPoker | 0 comments on Hacker News. Quick context: I'm building background job automation and keep hitting this pattern: 1. Job calls external API (Stripe, SendGrid, AWS) 2. API call succeeds 3. Job crashes before recording success 4. Job retries → calls API again → duplicate Example: process refund, send email notification, crash. Retry does both again. Customer gets duplicate refund email (or worse, duplicate refund). I see a few approaches: Option A: Store processed IDs in database Problem: Race between "check DB" and "call API" can still duplicate Option B: Use API idempotency keys (Stripe supports this) Problem: Not all APIs support it (legacy systems, third-party) Option C: Build deduplication layer that checks external system first Problem: Extra latency, extra complexity What do you do in production? Accept some duplicates? Only use APIs with idempotency? Something...

New ask Hacker News story: Stripe closed my account – no notice – my LLC was registered using Stripe Atlas

Stripe closed my account – no notice – my LLC was registered using Stripe Atlas 2 by travelyesim | 0 comments on Hacker News. I resell travel eSIMs data plan under my brand. I have 3DS enabled and strict Stripe Radar rules and backend logic to limit fraud. Last dispute I received was 7 months ago. Still, suddenly today my account closed. I did appeal and reach out to support through multiple channels via email and X. All payments by my current customers are being rejected. You can imagine how hard it was to attract customers in the current hot eSIM market. My LLC was registered using Stripe Atlas so they already have all my info in addition to the document I sent today during the appeal. Waiting 2-3 days to get a response whether they will reactivate or not is very long. That's supper unprofessional. Can you help push this post bcz they only seem to take serious action after a social media post..!!

New ask Hacker News story: I built a 151k-node GraphRAG swarm that autonomously invents SDG solutions

I built a 151k-node GraphRAG swarm that autonomously invents SDG solutions 2 by wisdomagi | 0 comments on Hacker News. Hi HN, I wanted to share a passion project I've been building: PROMETHEUS AGI. I got frustrated that most LLM/RAG applications just summarize text. I wanted to see if an agentic swarm could actually perform cross-domain reasoning to invent new physical solutions (focusing on UN SDGs). The Stack: Neo4j Aura (Free tier maxed out at 151k nodes / 400k edges) Ingestion: Google BigQuery (Patents) + OpenAlex API LLMs: Ollama (Llama 3) for zero-cost local entity extraction, Claude 3.5 via MCP for deep reasoning. UI: Streamlit (Digital Twin dashboard) + React/Vite (Landing). How it works: The swarm maps problems (e.g., biofouling in water filters) to isolated technologies across different domains (e.g., materials science + nanobiology) and looks for "Missing Links"—combinations that don't exist in the patent database yet. So far, the pipeline has autonomous...