Some checks failed
Build and Test YT2AI Bookmarklet / build-and-test (16.x) (push) Has been cancelled
Build and Test YT2AI Bookmarklet / build-and-test (18.x) (push) Has been cancelled
Build and Test YT2AI Bookmarklet / build-and-test (20.x) (push) Has been cancelled
Build and Test YT2AI Bookmarklet / release (push) Has been cancelled
Build and Test YT2AI Bookmarklet / security-scan (push) Has been cancelled
Add project structure with package.json, source code, tests, documentation, and GitHub workflows. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>
179 lines
7.3 KiB
Markdown
179 lines
7.3 KiB
Markdown
# Brainstorming Session: YouTube Subtitle Extraction & AI Summarization Bookmarklet
|
|
|
|
**Session Date:** September 5, 2025
|
|
**Topic:** YouTube subtitle extraction and AI summarization bookmarklet for Android Chrome
|
|
**Participant:** User
|
|
**Facilitator:** Mary (Business Analyst)
|
|
|
|
## Executive Summary
|
|
|
|
**Session Goals:**
|
|
- Broad exploration initially to discover creative possibilities
|
|
- Focus on technical feasibility and user experience
|
|
- Target audience: Mobile users watching educational YouTube content
|
|
- Technical constraints: Android Chrome, JavaScript bookmarklet, no special permissions
|
|
|
|
## Phase 1: Broad Exploration - What If Scenarios
|
|
|
|
### Core Vision Clarified
|
|
- **Primary Function:** AI summary of YouTube auto-generated subtitles using Claude.ai subscription
|
|
- **User Journey:** Visit youtube.com video → activate bookmarklet → get AI summary
|
|
- **Output Format:** New page with structured summary
|
|
|
|
### User Requirements Identified
|
|
|
|
**Summary Structure Desired:**
|
|
1. **Overview** - Video explanation for beginners
|
|
2. **Essential Points** - Key takeaways
|
|
3. **Value Proposition** - Why watch this video
|
|
4. **Beginner Summary** - Accessible explanation of the topic
|
|
5. **Sharing Capability** - Share the generated summary
|
|
|
|
## Phase 2: User Experience Journey Mapping
|
|
|
|
### User Journey Flow
|
|
1. **Activation Moment:** Any time during video viewing (typically after pausing)
|
|
2. **Processing Time:** User willing to wait for Claude.ai processing time
|
|
3. **Page Structure:** New page opens with summary, original video remains accessible
|
|
4. **Sharing Options:**
|
|
- WhatsApp: WhatsApp-formatted text
|
|
- Email: HTML format
|
|
- Clipboard: HTML format
|
|
|
|
### Detailed User Flow
|
|
- User watches educational YouTube video on Android Chrome
|
|
- User pauses video when ready for summary
|
|
- User activates bookmarklet
|
|
- New page opens with Claude.ai processing
|
|
- User receives structured summary in new page
|
|
- Video remains accessible in original tab
|
|
- Multiple sharing format options available
|
|
|
|
## Phase 3: Technical Constraint Exploration - Creative Solutions
|
|
|
|
### Technical Solutions Identified
|
|
|
|
**1. Subtitle Extraction Strategy:**
|
|
- Inspiration from https://www.downloadyoutubesubtitles.com
|
|
- Bookmarklet functionality to download auto-generated YouTube subtitles
|
|
- Extract subtitle data directly without requiring special permissions
|
|
|
|
**2. Claude.ai Integration (RPA Approach):**
|
|
- JavaScript automation to simulate human interaction
|
|
- Automatically input extracted subtitle text into Claude.ai textbox
|
|
- Programmatically trigger chat button click
|
|
- Mimics human workflow for seamless integration
|
|
|
|
**3. Mobile Formatting Requirements:**
|
|
- Readable, spaced layout for mobile consumption
|
|
- Easy-to-follow structure
|
|
- Clean, uncluttered presentation
|
|
- Optimized for Android Chrome viewing
|
|
|
|
**4. Sharing Implementation:**
|
|
- Clipboard integration for manual paste functionality
|
|
- User can paste content into any desired platform
|
|
- Maintains formatting flexibility across different sharing contexts
|
|
|
|
## Phase 4: Convergence & Prioritization - Solution Architecture
|
|
|
|
### Detailed Technical Discovery
|
|
|
|
**Subtitle Extraction API Identified:**
|
|
- Base URL: `https://www.downloadyoutubesubtitles.com/?u=[YouTube_URL]`
|
|
- Direct download endpoints discovered:
|
|
- Auto-generated English: `https://www.downloadyoutubesubtitles.com/get2.php?i=VIDEO_ID&format=txt&hl=a.en&a=`
|
|
- Manual English: `https://www.downloadyoutubesubtitles.com/get2.php?i=VIDEO_ID&format=txt&hl=en&a=`
|
|
- Challenge: CAPTCHA verification for human users
|
|
- Solution: User intervention for CAPTCHA when needed
|
|
|
|
### Prioritized Solution Architecture
|
|
|
|
**Selected Component Combination:**
|
|
- **A1**: Leverage downloadyoutubesubtitles.com API with CAPTCHA handling
|
|
- **B1 (Primary)**: RPA automation (fill textbox + click) in Claude.ai
|
|
- **B2 (Fallback)**: Manual paste if RPA fails
|
|
- **C2**: Output displayed in Claude.ai tab with results
|
|
|
|
### Implementation Strategy
|
|
1. Extract YouTube video ID from current page
|
|
2. Call subtitle API with user CAPTCHA intervention if needed
|
|
3. Open Claude.ai in new tab
|
|
4. Attempt RPA automation to input text and generate summary
|
|
5. If RPA fails, provide clipboard copy for manual paste
|
|
6. Display structured summary in Claude.ai interface
|
|
|
|
## Idea Categorization
|
|
|
|
### Immediate Opportunities (Ready to implement)
|
|
1. **Basic bookmarklet creation** - Extract video ID and create subtitle API calls
|
|
2. **CAPTCHA handling workflow** - Guide user through manual intervention when needed
|
|
3. **Claude.ai tab opening** - Simple window.open() to claude.ai with new chat
|
|
4. **Clipboard fallback** - Copy formatted text when RPA automation fails
|
|
|
|
### Future Innovations (Requires development/research)
|
|
1. **Advanced RPA automation** - Sophisticated Claude.ai interface manipulation
|
|
2. **Multi-language support** - Extend beyond English subtitles
|
|
3. **Batch processing** - Handle multiple videos in sequence
|
|
4. **Custom prompt templates** - Different summary styles for different content types
|
|
|
|
### Moonshots (Ambitious concepts)
|
|
1. **CAPTCHA bypass techniques** - Automated CAPTCHA solving
|
|
2. **Direct Claude.ai API integration** - Official API instead of RPA
|
|
3. **Browser extension evolution** - Full-featured extension vs bookmarklet
|
|
4. **Cross-platform compatibility** - iOS Safari, desktop browsers
|
|
|
|
## Action Planning
|
|
|
|
### Top 3 Priority Ideas
|
|
1. **MVP Bookmarklet (Priority 1)**
|
|
- Rationale: Core functionality with manual fallbacks
|
|
- Next steps: Code basic video ID extraction and API calls
|
|
- Resources needed: JavaScript development, API testing
|
|
- Timeline: 1-2 weeks for working prototype
|
|
|
|
2. **RPA Automation for Claude.ai (Priority 2)**
|
|
- Rationale: Key differentiator for seamless experience
|
|
- Next steps: Research Claude.ai DOM structure, test automation
|
|
- Resources needed: Advanced JavaScript, DOM manipulation
|
|
- Timeline: 2-3 weeks after MVP
|
|
|
|
3. **Enhanced User Experience (Priority 3)**
|
|
- Rationale: Mobile optimization and sharing features
|
|
- Next steps: Design mobile-friendly interface, implement sharing
|
|
- Resources needed: UI/UX design, mobile testing
|
|
- Timeline: 1-2 weeks after core functionality
|
|
|
|
## Reflection & Follow-up
|
|
|
|
### What Worked Well
|
|
- Clear technical constraints helped focus creative exploration
|
|
- Progressive technique flow revealed practical solutions
|
|
- User's specific requirements shaped realistic implementation path
|
|
- Discovery of existing API significantly simplified technical approach
|
|
|
|
### Areas for Further Exploration
|
|
- Alternative subtitle extraction methods as backup
|
|
- Error handling strategies for failed API calls
|
|
- Performance optimization for mobile devices
|
|
- User feedback integration for summary quality improvement
|
|
|
|
### Recommended Follow-up Techniques
|
|
- **Technical prototyping session** - Build and test core components
|
|
- **User journey testing** - Validate mobile experience with real users
|
|
- **Risk assessment brainstorming** - Identify and mitigate potential failures
|
|
- **Feature prioritization** - Refine roadmap based on user feedback
|
|
|
|
### Questions for Future Sessions
|
|
- How to handle videos without auto-generated subtitles?
|
|
- What fallback options for API downtime?
|
|
- How to optimize for different video lengths?
|
|
- What metrics to track for success measurement?
|
|
|
|
---
|
|
|
|
**Session Completed:** September 5, 2025
|
|
**Total Ideas Generated:** 25+ concepts across 4 phases
|
|
**Key Innovation:** RPA automation approach for Claude.ai integration
|
|
**Primary Outcome:** Clear technical architecture and implementation roadmap
|