The Complete Guide to Autobound's Signal Database
Job changes, SEC filings, hiring velocity, G2 reviews, website intelligence, LinkedIn posts — how Autobound's Signal Database delivers structured buying triggers via GCS bucket delivery.
Daniel Wiener
Oracle and USC Alum, Building the ChatGPT for Sales.

Why Signals Beat Static Data
Traditional prospecting relies on firmographics — company size, industry, tech stack — that tell you who might buy but nothing about when. Signals flip this model. When a CFO posts about digital transformation challenges, when a company's 10-K reveals an AI investment initiative, when Reddit threads surface competitive displacement conversations — these moments create windows where outreach actually converts.
The Autobound Signal Database captures these windows systematically. Rather than building dozens of scrapers and maintaining complex ETL pipelines, teams get a unified feed of schema-validated, entity-resolved signals delivered directly to GCS buckets or via API. Each signal comes with confidence scores, LLM-generated summaries, and the raw evidence needed for downstream processing.
Three ways teams use signals today:
- Sales triggers: Route high-intent signals (job changes, funding, hiring velocity spikes) to reps in real-time
- Account scoring: Feed signal density and recency into predictive models for prioritization
- Content personalization: Use extracted pain points, initiatives, and technologies to generate relevant messaging at scale
Contact-Level Signals
Contact signals capture what individual people are doing — job moves, public posts, communication style, shared connections. These are the signals that let you know who specifically is worth reaching out to and why now.
Job Changes
When someone changes jobs, there's a window where they're evaluating new tools, building new processes, and open to conversations they'd ignore six months later. The job change signal captures these transitions within a 90-day window:
{
"signal_type": "job-change",
"signal_subtype": "workExperienceJobChange",
"contact": {
"email": "[email protected]",
"linkedin_url": "linkedin.com/in/████████"
},
"company": {
"name": "Modern Enterprise",
"domain": "modernenterprise.com",
"linkedin_url": "linkedin.com/company/████████"
},
"data": {
"summary": "Founded Modern Enterprise as Founder & Principal Advisor 1 month ago.",
"new_job_title": "Founder & Principal Advisor",
"new_job_company_name": "Modern Enterprise",
"new_job_company_domain": "modernenterprise.com",
"new_job_company_linkedin_url": "linkedin.com/company/████████",
"new_job_description": "Building advisory practice focused on enterprise go-to-market strategy...",
"previous_company_name": "AMD",
"previous_company_domain": "amd.com",
"previous_company_linkedin_url": "linkedin.com/company/████████",
"previous_role_title": "Director of Marketing",
"previous_role_description": "Led product marketing for Network Technology Solutions Group...",
"previous_role_duration_months": 9,
"founded_new_company": true
}
}You get the full career context — where they came from, where they landed, what they were doing before, and what they're doing now. The founded_new_company flag identifies entrepreneurial transitions separately from lateral moves or promotions.
Coverage spans 10-25% of monitored contacts with weekly refresh. Full job change schema →
LinkedIn Posts
People tell you what they care about through their posts. The challenge is extracting structure from free text at scale. The LinkedIn post signal does this automatically — parsing pain points, initiatives, technologies, and competitors from each post:
{
"signal_type": "linkedin-post-contact",
"signal_subtype": "linkedinPost",
"contact": {
"name": "Aditya Shankar",
"job_title": "Senior Director of Marketing",
"linkedin_url": "linkedin.com/in/████████"
},
"company": {
"name": "hosted.ai",
"domain": "hosted.ai",
"linkedin_url": "linkedin.com/company/████████"
},
"data": {
"post_url": "https://www.linkedin.com/feed/update/urn:li:activity:████████/",
"post_text": "The inaugural AI Leadership Forum exceeded expectations. We brought together 80 leaders across tech, infrastructure, and VC. The 'Agentic Workforce' is here.",
"posted_date": "2026-01-11T06: 29: 06.850Z",
"num_likes": 116,
"pain_points": [
{ "topic": "scaling AI agent governance", "intensity": 0.7 }
],
"initiatives": [
{ "topic": "hosting AI Leadership Forum", "urgency": 0.9 }
],
"technologies_mentioned": [
{ "name": "DeepSeek", "status": "considering" }
],
"competitors_mentioned": [
{ "name": "packet.ai" }
]
}
}The intensity and urgency scores (0-1) let you prioritize — a pain point at 0.9 intensity is more pressing than one at 0.3. The status field on technologies tells you where they are in the adoption cycle: evaluating, using, migrating_from, considering, etc.
Refresh is bi-weekly with 25-50% coverage. Full LinkedIn posts schema →
Behavioral Profiles
How someone prefers to communicate matters as much as what you say to them. The behavioral profile signal infers DISC personality dimensions from a contact's digital footprint, providing guidance on tone, structure, and persuasion approach. Coverage reaches 50-75% of contacts with weekly refresh. Full behavioral profile schema →
Shared Experiences
Common ground creates instant rapport. The shared experience signals detect previous employers where both parties worked, alma mater connections, and overlapping professional networks — structured for easy matching. Full shared experiences schema →
Company-Level Signals
Company signals monitor public filings, digital footprints, hiring behavior, and market sentiment to identify organizational buying readiness.
SEC Filings
Autobound processes every 10-K, 10-Q, 8-K, 20-F, and 6-K filing using LLMs trained on SEC document structure. Rather than parsing 200-page annual reports yourself, you get structured signals classified into 70+ subtypes — aiInvestment, digitalTransformation, costReduction, internationalExpansion, ceoChange, and so on.
{
"signal_type": "10k",
"signal_subtype": "capexIncrease",
"company": {
"name": "Applied Materials, Inc.",
"domain": "www.appliedmaterials.com",
"ticker": "AMAT"
},
"data": {
"summary": "AMAT nearly doubles CapEx to $2.26B in FY2025, driven by US infrastructure investment.",
"detail": "Capital expenditures surged by $1.07B (+90%) year-over-year.",
"confidence": "high",
"excerpts": "Capital expenditures $2,260 million for fiscal 2025 versus $1,190 million in fiscal 2024.",
"source_url": "https://www.sec.gov/Archives/edgar/data/6951/000162828025056742/amat-20251026.htm",
"filing_date": "2025-12-18",
"metrics": {
"dollar_millions": 2260,
"pct": 0.9,
"pct_context": "Year-over-year increase in capital expenditures"
}
}
}The metrics object provides structured numerical data when available — dollar amounts, percentages, timeframes — so you can filter signals by magnitude without parsing the summary text. All SEC signals refresh weekly (upgraded from monthly in January 2026), with cross-signal deduplication ensuring the same executive change mentioned in a 10-Q and earnings call generates only one signal. Full 10-K schema →
Glassdoor Reviews
The glassdoor signal aggregates employee sentiment across rating categories, curated feedback excerpts, and competitor mentions. You get the overall rating, breakdown by dimension (culture, compensation, work-life balance, management, career opportunities), and actual review snippets organized by sentiment.
{
"signal_type": "glassdoor",
"signal_subtype": "glassdoorCompanyReviewsAndRatings",
"company": {
"name": "ZoomInfo",
"domain": "zoominfo.com"
},
"data": {
"reviewCount": 1970,
"rating": 3.6,
"ratingBreakdown": {
"cultureAndValues": 3.5,
"workLifeBalance": 3.2,
"seniorManagement": 3.4,
"careerOpportunities": 3.7
},
"employeeFeedback": {
"pros": ["Tons of challenges, exciting environment, AI focus, strong product roadmap."],
"cons": ["Lack of training, toxic culture, erratic management."]
},
"competitors": ["Dun & Bradstreet", "Salesforce"],
"reviewLinks": {
"reviews": "https://www.glassdoor.com/Reviews/ZoomInfo-Reviews-E22253.htm"
}
}
}The competitors array surfaces companies mentioned by employees — often revealing competitive dynamics not visible elsewhere. Subtypes like glassdoorConsistentLeadershipComplaints and glassdoorTalentRetentionConcerns enable filtering for specific internal challenges. Full Glassdoor schema →
Employee Growth & Departmental Trends
The employee breakdown signal provides headcount distribution and growth rates by department, enabling detection of organizational priorities without manual research.
{
"signal_type": "employee-breakdown",
"signal_subtype": "employeeBreakdownAndGrowth",
"company": {
"name": "Amazon",
"domain": "amazon.com"
},
"data": {
"growth": {
"sixMonth": 3,
"oneYear": 7,
"twoYear": 9
},
"organization": {
"headCount": 579336,
"yearFounded": 1995
},
"headcountByDepartment": {
"engineering": 144558,
"sales": 14451,
"marketing": 5446,
"productManagement": 1740
},
"1yearGrowthByDepartment": {
"sales": 24,
"engineering": 4,
"productManagement": -3
},
"analysis": {
"summary": "Massive company showing slow but steady growth.",
"insight": "With a very large headcount (~579k), this suggests a mature organization."
}
}
}The 1yearGrowthByDepartment object lets you filter for companies investing in specific functions — engineering headcount up 40% signals product investment, sales hiring spikes indicate GTM expansion. Full employee breakdown schema →
News Events
The news signal captures leadership changes, funding announcements, partnerships, and other public events with structured metadata.
{
"signal_type": "news",
"signal_subtype": "hires",
"company": {
"name": "Egnyte",
"domain": "egnyte.com"
},
"data": {
"insightTitle": "Egnyte Appoints Prasad Gune as Chief Product Officer",
"insightContact": "Prasad Gune",
"insightJobTitle": "Chief Product Officer",
"insightDate": "September 4, 2024",
"insightUrl": "https://www.einnews.com/pr_news/740455906/egnyte-appoints-prasad-gune-as-chief-product-officer",
"insightArticleSource": "einnews.com",
"insightCategories": ["hires"]
}
}The insightCategories array provides machine-readable event classification. News signals refresh weekly, with daily delivery under evaluation. Full news schema →
Product Hunt Launches
Track when companies launch new products on Product Hunt — useful for identifying GTM activity and product development priorities.
{
"signal_type": "producthunt",
"signal_subtype": "companyProductHuntLaunch",
"company": {
"name": "In 1 Inbox",
"domain": "in1inbox.com"
},
"data": {
"productName": "In 1 Inbox",
"productTagline": "One place to manage, reply and sell across all platforms",
"productDescription": "Manage all your social DMs, comments, and sales chats in one AI-powered inbox.",
"productHuntUrl": "https://www.producthunt.com/posts/in-1-inbox",
"votes": 4,
"comments": 1,
"createdAt": "2025-06-23T07: 01: 00Z",
"topics": ["Sales", "Social Media", "Artificial Intelligence"]
}
}The topics array enables filtering by product category. Full Product Hunt schema →
Hiring Velocity
Two complementary signals track hiring behavior across 21+ million domains. hiring-velocity measures pace with accelerating, steady, or decelerating trend indicators, comparing current openings to 60 days prior. hiring-trends provides department-level snapshots. Both refresh weekly. Full hiring velocity schema →
Reddit Mentions
The reddit-company signal aggregates discussions from B2B-relevant subreddits (r/sysadmin, r/devops, r/saas, r/msp) with structured subtypes: churnRisk, buyingIntent, competitorMention, pricingConcern. Each signal includes a moderation score for brand safety (recommended filtering: confidence_score >= 0.8). Sample distribution from recent batches: 925 buying intent signals, 321 churn risk signals, 192 competitive intelligence signals. Full Reddit schema →
Technographics
The tech-stack signal covers 218M+ companies with three core subtypes: techUsedProspectUsesCompetitor (companies currently on a competing product), techUsedProspectRecentlyAdoptedCompetitor (switched in the last 90 days), and techUsedProspectUsesComplementaryTech (good fit based on adjacent tools). Full tech stack schema →
Website Intelligence
The website-intelligence signal monitors ~2 million company websites for approximately 3,600 distinct change types: pricing page updates, new product launches, partnership announcements, customer logo additions, security certifications, messaging changes. Monthly refresh. Full website intelligence schema →
Universal Schema
Every signal follows a normalized structure regardless of source. The outer envelope is consistent — signal_id, signal_type, signal_subtype, detected_at, association (company or contact), plus entity resolution fields. The data object varies by signal type but always includes an LLM-generated summary.
Entity resolution coverage:
company.domain: 99%+ coverage (primary join key)company.linkedin_url: 95%+ coveragecontact.email: 85-95% coveragecontact.linkedin_url: 90-98% coverage
The association field indicates whether a signal is company-level or contact-level, enabling clean routing logic. Full schema documentation →
GCS Delivery Infrastructure
Bucket Structure
Each signal type has a dedicated bucket with timestamped folders:
gs://autobound-10k/
├── 2026-02-03T12-00-00Z/
│ ├── output.jsonl ← Streaming, human-readable
│ └── output.parquet ← Analytics-optimizedBoth formats contain identical data. JSONL suits streaming ingestion and debugging; Parquet integrates directly with BigQuery, Snowflake, and Spark.
Manifest Files
A /manifest/ folder in each bucket provides per-drop JSON manifests with run timing, record counts, and completion status — enabling downstream processing triggers without polling.
Refresh Cadences
Weekly: SEC filings (all types), news, hiring velocity/trends
Bi-weekly: LinkedIn posts (contact)
Monthly: Reddit, G2, Glassdoor, GitHub, website intelligence
2026 Roadmap
New sources: Capterra and TrustRadius product reviews, Quora Q&A signals.
Infrastructure: Direct API access for signal fetch with age parameters, company connections graph (vendors, competitors, customers, investors), competitor URLs/domains in mentions, hiring velocity percentage change scores, daily news delivery.
Coverage: +25% audience expansion near-term, ~2× monitoring pool within 1-2 months, geographic expansion beyond North America.
Integration in 30 Minutes
Step 1: Receive service account JSON credentials from Autobound
Step 2: Authenticate
gcloud auth activate-service-account --key-file=~/credentials/autobound-key.json
gcloud storage ls --project=autobound-signal-deliveryStep 3: Pull latest data
LATEST=$(gcloud storage ls gs://autobound-10k/ | tail -1)
gcloud storage cp "${LATEST}output.parquet" ./signals/Step 4: Match to your CRM
SELECT s.*, c.*
FROM signals.latest s
LEFT JOIN crm.companies c
ON LOWER(s.company.domain) = LOWER(c.website_domain)Get Started
The Autobound Signal Database removes the infrastructure burden from signal-based data products. Instead of building scrapers, maintaining ETL pipelines, and normalizing disparate data sources, teams get unified, schema-validated intelligence delivered on schedule.
For data teams: Consistent schemas, enterprise authentication, Parquet for analytics, JSONL for streaming — integrate once, scale with your product.
Explore the full API documentation or contact our team to discuss your use case. Current customers can reach support@autobound.ai for technical questions.
Explore More
- Signal Data Products — Browse the full signal catalog with coverage stats and delivery options
- For Platforms — Learn how GTM platforms license Autobound's signal intelligence layer
- For Data Teams — Build vs. buy signal infrastructure analysis for data engineering leaders
- vs. Data Providers — See how Autobound compares to ZoomInfo, Clay, Apollo, and 6sense
- Contact Sales — Discuss your data licensing use case with our team
Ready to integrate signal intelligence?
Get started with 18+ proprietary signal types delivered via GCS, API, or flat file.
Talk to Our Data Team
Related Articles

Why Platform Teams License Signal Data Instead of Building It
Building scrapers and ETL pipelines for signal data costs $250K+/year. Learn why platform teams license pre-built signal intelligence instead.

AI SDR vs. Human SDR: What the Data Says
AI SDR vs human SDR in 2026: real cost-per-lead data, conversion rates, ROI timelines, and the 50-70% churn problem. Data-driven analysis for sales leaders.

The Outbound Sales Playbook for 2026
The 2026 outbound sales playbook: signal-based targeting, AI personalization, multi-channel cadences. Real data, templates, and frameworks that book meetings.
Ready to Transform Your Outreach?
See how Autobound uses AI and real-time signals to generate hyper-personalized emails at scale.