You are the CFO of a $50M/year commercial HVAC contractor.
Last quarter, three projects closed at 6.8% realized margin against a 15.2% bid margin. This is no longer an anomaly β itβs a pattern.
Dashboards show data.
You need something that thinks.
Your mission is to build an AI agent β not a chatbot β that autonomously monitors a portfolio of HVAC construction projects, identifies margin erosion risks, investigates root causes, takes action, and reports back.
The agent must:
- Scan the entire portfolio
- Detect margin risk signals
- Investigate by chaining tool calls
- Produce actionable outputs
- Support follow-up questions with memory
- Communicate findings clearly in business language
This is not a dashboard challenge.
This is not a chatbot challenge.
You are building an autonomous AI agent using:
- An LLM (reasoning brain)
- Tool calling (data access + calculations)
- Memory
- A looping mechanism (stopWhen)
- Email capability (proactive reporting)
Your agent should continue investigating until it understands the situation β not stop after one query.
π DatasetParticipants receive a realistic construction portfolio dataset (~18K records), including:
- contracts.csv
- sov.csv
- sov_budget.csv
- labor_logs.csv
- material_deliveries.csv
- billing_history.csv
- billing_line_items.csv
- change_orders.csv
- rfis.csv
- field_notes.csv (~1,300 unstructured reports)
Embedded within the data are real-world signals:
- Labor overruns
- Scope drift
- Verbal approvals
- Billing lags
- Pending change orders
- RFI-related exposure
Not every project is failing.
Not every risk is obvious.
Your agent must find the story.
Given a prompt like:
βHowβs my portfolio doing?βYour agent should autonomously:
- Assess overall margin health
- Identify high-risk projects
- Investigate root causes (labor, materials, billing, change orders, RFIs, field notes)
- Quantify financial exposure
- Recommend specific recovery actions
- Send an email summary or alert
- Support follow-up conversation with context memory
Requirements
π¦ Submission Requirements
Each team must submit:
Working Agent- GitHub repository OR deployed URL
- Must be functional and accessible
- v0 project link or IDE prompt history
- Architecture overview
- Tool design
- Model choice & strategy
- How looping works
- Email implementation
- What you would improve with more time
- 3β5 minutes
- Show autonomous investigation
- Show email being triggered
- Demonstrate follow-up memory
- β A static dashboard
- β A basic Q&A chatbot
- β A data pipeline project
- β A black-box AI with no reasoning visibility
If it doesnβt reason, investigate, and act β itβs not an agent.
Prizes
Surprise!!!
Surprise to be announced!
Devpost Achievements
Submitting to this hackathon could earn you:
Judges
Caroline Ciaramitaro
Community at v0 by Vercel
Mitchell Itkin
Founder of Pulse AI NYC
Mark Bakshiyev
Co-founder of Pulse AI NYC
DJ Lee
Co-founder of Pulse AI NYC
Judging Criteria
-
Agent Intelligence (40 pts)
- Autonomous reasoning across portfolio - Multi-step tool chaining - Accurate financial calculations - Actionable outputs (not vague flags) -
Agent Experience (30 pts)
- Transparent step-by-step reasoning - Real-time streaming responses - Conversational follow-up with memory - Clear, business-friendly communication -
Implementation Quality (20 pts)
- Built using required stack - Proper use of Granola - Email reporting works - Handles ~18K records efficiently - Reasonable performance - Deployed and accessible -
Business Insight (10 pts)
- Explains why margin erosion occurs - Quantifies exposure - Forecasts potential outcomes -
Bonus (Up to +20 pts)
- Proactive alerting without user prompt - Cross-project pattern detection - Confidence levels / uncertainty acknowledgment - Deep multi-turn conversational memory
Questions? Email the hackathon manager
Tell your friends
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
