Technology SignalsFinancial SignalsThought LeadershipAI for Sales

The Complete Guide to Autobound's Signal Database

Job changes, SEC filings, hiring velocity, G2 reviews, website intelligence, LinkedIn posts — how Autobound's Signal Database delivers structured buying triggers via GCS bucket delivery.

Daniel Wiener

Daniel Wiener

Oracle and USC Alum, Building the ChatGPT for Sales.

··10 min read
The Complete Guide to Autobound's Signal Database

Why Signals Beat Static Data

Traditional prospecting relies on firmographics — company size, industry, tech stack — that tell you who might buy but nothing about when. Signals flip this model. When a CFO posts about digital transformation challenges, when a company's 10-K reveals an AI investment initiative, when Reddit threads surface competitive displacement conversations — these moments create windows where outreach actually converts.

The Autobound Signal Database captures these windows systematically. Rather than building dozens of scrapers and maintaining complex ETL pipelines, teams get a unified feed of schema-validated, entity-resolved signals delivered directly to GCS buckets or via API. Each signal comes with confidence scores, LLM-generated summaries, and the raw evidence needed for downstream processing.

Three ways teams use signals today:

  • Sales triggers: Route high-intent signals (job changes, funding, hiring velocity spikes) to reps in real-time
  • Account scoring: Feed signal density and recency into predictive models for prioritization
  • Content personalization: Use extracted pain points, initiatives, and technologies to generate relevant messaging at scale

Contact-Level Signals

Contact signals capture what individual people are doing — job moves, public posts, communication style, shared connections. These are the signals that let you know who specifically is worth reaching out to and why now.

Job Changes

When someone changes jobs, there's a window where they're evaluating new tools, building new processes, and open to conversations they'd ignore six months later. The job change signal captures these transitions within a 90-day window:

{
  "signal_type": "job-change",
  "signal_subtype": "workExperienceJobChange",
  "contact": {
    "email": "[email protected]",
    "linkedin_url": "linkedin.com/in/████████"
  },
  "company": {
    "name": "Modern Enterprise",
    "domain": "modernenterprise.com",
    "linkedin_url": "linkedin.com/company/████████"
  },
  "data": {
    "summary": "Founded Modern Enterprise as Founder & Principal Advisor 1 month ago.",
    "new_job_title": "Founder & Principal Advisor",
    "new_job_company_name": "Modern Enterprise",
    "new_job_company_domain": "modernenterprise.com",
    "new_job_company_linkedin_url": "linkedin.com/company/████████",
    "new_job_description": "Building advisory practice focused on enterprise go-to-market strategy...",
    "previous_company_name": "AMD",
    "previous_company_domain": "amd.com",
    "previous_company_linkedin_url": "linkedin.com/company/████████",
    "previous_role_title": "Director of Marketing",
    "previous_role_description": "Led product marketing for Network Technology Solutions Group...",
    "previous_role_duration_months": 9,
    "founded_new_company": true
  }
}

You get the full career context — where they came from, where they landed, what they were doing before, and what they're doing now. The founded_new_company flag identifies entrepreneurial transitions separately from lateral moves or promotions.

Coverage spans 10-25% of monitored contacts with weekly refresh. Full job change schema →

LinkedIn Posts

People tell you what they care about through their posts. The challenge is extracting structure from free text at scale. The LinkedIn post signal does this automatically — parsing pain points, initiatives, technologies, and competitors from each post:

{
  "signal_type": "linkedin-post-contact",
  "signal_subtype": "linkedinPost",
  "contact": {
    "name": "Aditya Shankar",
    "job_title": "Senior Director of Marketing",
    "linkedin_url": "linkedin.com/in/████████"
  },
  "company": {
    "name": "hosted.ai",
    "domain": "hosted.ai",
    "linkedin_url": "linkedin.com/company/████████"
  },
  "data": {
    "post_url": "https://www.linkedin.com/feed/update/urn:li:activity:████████/",
    "post_text": "The inaugural AI Leadership Forum exceeded expectations. We brought together 80 leaders across tech, infrastructure, and VC. The 'Agentic Workforce' is here.",
    "posted_date": "2026-01-11T06: 29: 06.850Z",
    "num_likes": 116,
    "pain_points": [
      { "topic": "scaling AI agent governance", "intensity": 0.7 }
    ],
    "initiatives": [
      { "topic": "hosting AI Leadership Forum", "urgency": 0.9 }
    ],
    "technologies_mentioned": [
      { "name": "DeepSeek", "status": "considering" }
    ],
    "competitors_mentioned": [
      { "name": "packet.ai" }
    ]
  }
}

The intensity and urgency scores (0-1) let you prioritize — a pain point at 0.9 intensity is more pressing than one at 0.3. The status field on technologies tells you where they are in the adoption cycle: evaluating, using, migrating_from, considering, etc.

Refresh is bi-weekly with 25-50% coverage. Full LinkedIn posts schema →

Behavioral Profiles

How someone prefers to communicate matters as much as what you say to them. The behavioral profile signal infers DISC personality dimensions from a contact's digital footprint, providing guidance on tone, structure, and persuasion approach. Coverage reaches 50-75% of contacts with weekly refresh. Full behavioral profile schema →

Shared Experiences

Common ground creates instant rapport. The shared experience signals detect previous employers where both parties worked, alma mater connections, and overlapping professional networks — structured for easy matching. Full shared experiences schema →


Company-Level Signals

Company signals monitor public filings, digital footprints, hiring behavior, and market sentiment to identify organizational buying readiness.

SEC Filings

Autobound processes every 10-K, 10-Q, 8-K, 20-F, and 6-K filing using LLMs trained on SEC document structure. Rather than parsing 200-page annual reports yourself, you get structured signals classified into 70+ subtypes — aiInvestment, digitalTransformation, costReduction, internationalExpansion, ceoChange, and so on.

{
  "signal_type": "10k",
  "signal_subtype": "capexIncrease",
  "company": {
    "name": "Applied Materials, Inc.",
    "domain": "www.appliedmaterials.com",
    "ticker": "AMAT"
  },
  "data": {
    "summary": "AMAT nearly doubles CapEx to $2.26B in FY2025, driven by US infrastructure investment.",
    "detail": "Capital expenditures surged by $1.07B (+90%) year-over-year.",
    "confidence": "high",
    "excerpts": "Capital expenditures $2,260 million for fiscal 2025 versus $1,190 million in fiscal 2024.",
    "source_url": "https://www.sec.gov/Archives/edgar/data/6951/000162828025056742/amat-20251026.htm",
    "filing_date": "2025-12-18",
    "metrics": {
      "dollar_millions": 2260,
      "pct": 0.9,
      "pct_context": "Year-over-year increase in capital expenditures"
    }
  }
}

The metrics object provides structured numerical data when available — dollar amounts, percentages, timeframes — so you can filter signals by magnitude without parsing the summary text. All SEC signals refresh weekly (upgraded from monthly in January 2026), with cross-signal deduplication ensuring the same executive change mentioned in a 10-Q and earnings call generates only one signal. Full 10-K schema →

Glassdoor Reviews

The glassdoor signal aggregates employee sentiment across rating categories, curated feedback excerpts, and competitor mentions. You get the overall rating, breakdown by dimension (culture, compensation, work-life balance, management, career opportunities), and actual review snippets organized by sentiment.

{
  "signal_type": "glassdoor",
  "signal_subtype": "glassdoorCompanyReviewsAndRatings",
  "company": {
    "name": "ZoomInfo",
    "domain": "zoominfo.com"
  },
  "data": {
    "reviewCount": 1970,
    "rating": 3.6,
    "ratingBreakdown": {
      "cultureAndValues": 3.5,
      "workLifeBalance": 3.2,
      "seniorManagement": 3.4,
      "careerOpportunities": 3.7
    },
    "employeeFeedback": {
      "pros": ["Tons of challenges, exciting environment, AI focus, strong product roadmap."],
      "cons": ["Lack of training, toxic culture, erratic management."]
    },
    "competitors": ["Dun & Bradstreet", "Salesforce"],
    "reviewLinks": {
      "reviews": "https://www.glassdoor.com/Reviews/ZoomInfo-Reviews-E22253.htm"
    }
  }
}

The competitors array surfaces companies mentioned by employees — often revealing competitive dynamics not visible elsewhere. Subtypes like glassdoorConsistentLeadershipComplaints and glassdoorTalentRetentionConcerns enable filtering for specific internal challenges. Full Glassdoor schema →

Employee Growth & Departmental Trends

The employee breakdown signal provides headcount distribution and growth rates by department, enabling detection of organizational priorities without manual research.

{
  "signal_type": "employee-breakdown",
  "signal_subtype": "employeeBreakdownAndGrowth",
  "company": {
    "name": "Amazon",
    "domain": "amazon.com"
  },
  "data": {
    "growth": {
      "sixMonth": 3,
      "oneYear": 7,
      "twoYear": 9
    },
    "organization": {
      "headCount": 579336,
      "yearFounded": 1995
    },
    "headcountByDepartment": {
      "engineering": 144558,
      "sales": 14451,
      "marketing": 5446,
      "productManagement": 1740
    },
    "1yearGrowthByDepartment": {
      "sales": 24,
      "engineering": 4,
      "productManagement": -3
    },
    "analysis": {
      "summary": "Massive company showing slow but steady growth.",
      "insight": "With a very large headcount (~579k), this suggests a mature organization."
    }
  }
}

The 1yearGrowthByDepartment object lets you filter for companies investing in specific functions — engineering headcount up 40% signals product investment, sales hiring spikes indicate GTM expansion. Full employee breakdown schema →

News Events

The news signal captures leadership changes, funding announcements, partnerships, and other public events with structured metadata.

{
  "signal_type": "news",
  "signal_subtype": "hires",
  "company": {
    "name": "Egnyte",
    "domain": "egnyte.com"
  },
  "data": {
    "insightTitle": "Egnyte Appoints Prasad Gune as Chief Product Officer",
    "insightContact": "Prasad Gune",
    "insightJobTitle": "Chief Product Officer",
    "insightDate": "September 4, 2024",
    "insightUrl": "https://www.einnews.com/pr_news/740455906/egnyte-appoints-prasad-gune-as-chief-product-officer",
    "insightArticleSource": "einnews.com",
    "insightCategories": ["hires"]
  }
}

The insightCategories array provides machine-readable event classification. News signals refresh weekly, with daily delivery under evaluation. Full news schema →

Product Hunt Launches

Track when companies launch new products on Product Hunt — useful for identifying GTM activity and product development priorities.

{
  "signal_type": "producthunt",
  "signal_subtype": "companyProductHuntLaunch",
  "company": {
    "name": "In 1 Inbox",
    "domain": "in1inbox.com"
  },
  "data": {
    "productName": "In 1 Inbox",
    "productTagline": "One place to manage, reply and sell across all platforms",
    "productDescription": "Manage all your social DMs, comments, and sales chats in one AI-powered inbox.",
    "productHuntUrl": "https://www.producthunt.com/posts/in-1-inbox",
    "votes": 4,
    "comments": 1,
    "createdAt": "2025-06-23T07: 01: 00Z",
    "topics": ["Sales", "Social Media", "Artificial Intelligence"]
  }
}

The topics array enables filtering by product category. Full Product Hunt schema →

Hiring Velocity

Two complementary signals track hiring behavior across 21+ million domains. hiring-velocity measures pace with accelerating, steady, or decelerating trend indicators, comparing current openings to 60 days prior. hiring-trends provides department-level snapshots. Both refresh weekly. Full hiring velocity schema →

Reddit Mentions

The reddit-company signal aggregates discussions from B2B-relevant subreddits (r/sysadmin, r/devops, r/saas, r/msp) with structured subtypes: churnRisk, buyingIntent, competitorMention, pricingConcern. Each signal includes a moderation score for brand safety (recommended filtering: confidence_score >= 0.8). Sample distribution from recent batches: 925 buying intent signals, 321 churn risk signals, 192 competitive intelligence signals. Full Reddit schema →

Technographics

The tech-stack signal covers 218M+ companies with three core subtypes: techUsedProspectUsesCompetitor (companies currently on a competing product), techUsedProspectRecentlyAdoptedCompetitor (switched in the last 90 days), and techUsedProspectUsesComplementaryTech (good fit based on adjacent tools). Full tech stack schema →

Website Intelligence

The website-intelligence signal monitors ~2 million company websites for approximately 3,600 distinct change types: pricing page updates, new product launches, partnership announcements, customer logo additions, security certifications, messaging changes. Monthly refresh. Full website intelligence schema →


Universal Schema

Every signal follows a normalized structure regardless of source. The outer envelope is consistent — signal_id, signal_type, signal_subtype, detected_at, association (company or contact), plus entity resolution fields. The data object varies by signal type but always includes an LLM-generated summary.

Entity resolution coverage:

  • company.domain: 99%+ coverage (primary join key)
  • company.linkedin_url: 95%+ coverage
  • contact.email: 85-95% coverage
  • contact.linkedin_url: 90-98% coverage

The association field indicates whether a signal is company-level or contact-level, enabling clean routing logic. Full schema documentation →


GCS Delivery Infrastructure

Bucket Structure

Each signal type has a dedicated bucket with timestamped folders:

gs://autobound-10k/
├── 2026-02-03T12-00-00Z/
│   ├── output.jsonl    ← Streaming, human-readable
│   └── output.parquet  ← Analytics-optimized

Both formats contain identical data. JSONL suits streaming ingestion and debugging; Parquet integrates directly with BigQuery, Snowflake, and Spark.

Manifest Files

A /manifest/ folder in each bucket provides per-drop JSON manifests with run timing, record counts, and completion status — enabling downstream processing triggers without polling.

Refresh Cadences

Weekly: SEC filings (all types), news, hiring velocity/trends

Bi-weekly: LinkedIn posts (contact)

Monthly: Reddit, G2, Glassdoor, GitHub, website intelligence

Full delivery documentation →


2026 Roadmap

New sources: Capterra and TrustRadius product reviews, Quora Q&A signals.

Infrastructure: Direct API access for signal fetch with age parameters, company connections graph (vendors, competitors, customers, investors), competitor URLs/domains in mentions, hiring velocity percentage change scores, daily news delivery.

Coverage: +25% audience expansion near-term, ~2× monitoring pool within 1-2 months, geographic expansion beyond North America.

Full roadmap →


Integration in 30 Minutes

Step 1: Receive service account JSON credentials from Autobound

Step 2: Authenticate

gcloud auth activate-service-account --key-file=~/credentials/autobound-key.json
gcloud storage ls --project=autobound-signal-delivery

Step 3: Pull latest data

LATEST=$(gcloud storage ls gs://autobound-10k/ | tail -1)
gcloud storage cp "${LATEST}output.parquet" ./signals/

Step 4: Match to your CRM

SELECT s.*, c.*
FROM signals.latest s
LEFT JOIN crm.companies c
  ON LOWER(s.company.domain) = LOWER(c.website_domain)

Get Started

The Autobound Signal Database removes the infrastructure burden from signal-based data products. Instead of building scrapers, maintaining ETL pipelines, and normalizing disparate data sources, teams get unified, schema-validated intelligence delivered on schedule.

For data teams: Consistent schemas, enterprise authentication, Parquet for analytics, JSONL for streaming — integrate once, scale with your product.


Explore the full API documentation or contact our team to discuss your use case. Current customers can reach support@autobound.ai for technical questions.

Explore More

  • Signal Data Products — Browse the full signal catalog with coverage stats and delivery options
  • For Platforms — Learn how GTM platforms license Autobound's signal intelligence layer
  • For Data Teams — Build vs. buy signal infrastructure analysis for data engineering leaders
  • vs. Data Providers — See how Autobound compares to ZoomInfo, Clay, Apollo, and 6sense
  • Contact Sales — Discuss your data licensing use case with our team

Ready to integrate signal intelligence?

Get started with 18+ proprietary signal types delivered via GCS, API, or flat file.

Talk to Our Data Team
Daniel Wiener

Daniel Wiener

Oracle and USC Alum, Building the ChatGPT for Sales.

View on LinkedIn →

Ready to Transform Your Outreach?

See how Autobound uses AI and real-time signals to generate hyper-personalized emails at scale.