Spencer Klug
Skills/Technologies
DBT | AWS | Databricks | Terraform | CI/CD | SQL | Python | Airflow | Power BI | Snowflake | Docker | Kubernetes | FastAPI
Experience
Sr. Data Engineer
October 2024 - PresentFunko
- Led the data modeling & delivery of the Customer Data Platform driving a targeted ~$10 million in additional sales.
- Created custom replication pipeline to ingest over 4 trillion records from a 2012 MS SQL instance.
- Built replication pipeline for Funko's product lifecycle management tool based on their XML endpoint.
- Stood up a modern data architecture leveraging dbt, Snowflake, MWAA Airflow, AWS.
- Built core business-facing data models including orders, products, customers, etc.
- Implemented type 2 slowly changing dimensions for key data models to provide history for data models such as products.
- Setup infrastructure-as-code (IaC) with Terraform to manage Snowflake, automating provisioning of roles, permissions, etc.
- Built CI/CD pipelines across IaC, dbt, MWAA Airflow repositories enforcing our style guide, deployments and testing.
- Owned and operated Snowflake including reporting out on metrics such as data quality, accuracy and timeliness.
- Provided the majority of code reviews for the team, providing feedback to junior engineers.
Data Engineering & Data Intelligence, Team Lead
January 2023 - July 2024PitchBook Data
- Led a team of 3 Data Engineers and 13 Data Intelligence analysts supporting over 1,000 stakeholders.
- Owned integrating data from PitchBook's $600 million acquisition of LCD data into the PitchBook product.
- Created a custom Customer Data Platform enriching inbound leads utilizing PitchBook's unique research process resulting in a 3% increase in conversion rate for inbound leads into Demo's.
- Integrated 3rd party streaming API's into PitchBook's core product utilizing Snowflake's Snowpipe functionality.
- Standardized teams' development lifecycle including Product Review Documents and Feature Documents.
- Own reporting out on the teams Roadmap & KPI's across the organization and executive leadership.
- Established, documented & reported out on Service Level Agreements (SLAs) with stakeholders.
Data Engineer
May 2022 - January 2023PitchBook Data
- Created customer pipelines from 3rd party data such as government business registries and 3rd party data providers.
- Designed core dbt models, creating a single source of truth for key reporting metrics utilized across the business.
- Built classifier using xgboost to filter thousands of daily inbound regulatory filings (PDF's) based on relevance.
- Utilized FastAPI and kubernetes to create & manage micro-services that provided entity resolution, classification, etc.
- Managed Snowflake and AWS environments via Terraform.
- Designed core internal packages to help provide abstraction & common utilities across the team.
Data Operations Manager, Valuations & Private Financials
October 2018 - May 2022PitchBook Data
- One of 5 finalists for "Drive and Embrace Change" award, a yearly nomination process across nearly 2,000 employees.
- Launched and operationalized a team of 10 to extract key financial values from publicly available regulatory documents.
- Created data pipelines from government registries, resulting in increased output per employee from 22 to 30 valuations daily.
- Owned & reported out on key performance indicators to leadership on a monthly basis and the CEO on a quarterly basis.
- Increased yearly team output by 336% from 2018 to 2021 through process standardization & automation.
- Built core BI reporting helping to track key performance indicators such as work order yield, productivity and SLA's.
- Improved work order timeliness from 165 hours to 27 hours and 99.6% meeting the service-level agreement of 48 hours.
Education
Bachelor of Business Administration
Finance and Economics Majors
Gonzaga University, Spokane, WA