Presales Skill Analysis Platform
Presales Skill Analysis Platform
Built a Python, PostgreSQL, and Streamlit analytics workflow to identify the most common skills, tools, and hiring signals in junior solutions engineering and presales roles in Ireland.
Overview
I built this project to answer a practical question: what do companies actually look for in junior solutions engineers, presales engineers, and solutions consultants?
Job descriptions for these roles are spread across LinkedIn, company career pages, and niche job boards, and the signal is noisy. Titles vary, requirements are inconsistent, and many job boards limit scraping. Instead of treating this as a pure data exercise, I approached it from a solutions engineering angle: define the user problem, design a practical workflow, build a reliable system, and surface the output in a way that non-technical stakeholders can use.
The Problem
Candidates targeting junior solutions engineering roles often do not have a clear, evidence-based view of which technical skills appear most often, which cloud and SaaS concepts are expected at entry level, and how requirements vary across companies and role titles.
- which technical skills appear most often
- which cloud and SaaS concepts are actually expected at entry level
- how requirements vary across companies and role titles
My Approach
I designed the project around a manual-first ingestion model. Instead of scraping job boards, I created a Streamlit intake app where a user can paste a job description and core metadata such as title, company, source, URL, and location.
That design choice solved a few problems at once: it reduced legal and technical risk from scraping restrictions, improved data quality through structured intake, preserved raw descriptions for later parsing, and aligned the system with realistic analyst workflows.
What I Built
- A PostgreSQL schema for companies, job postings, normalized skills, job-skill relationships, analysis questions, results, and reports
- A Docker Compose setup for reproducible local database setup
- A Streamlit intake app that writes job postings directly into PostgreSQL
- Validation rules for required fields and minimum job description length
- A Streamlit dashboard for filtering postings by company and inspecting job details
- Supporting architecture and discovery artifacts documenting the solution for non-technical stakeholders
Tech Stack
Python, PostgreSQL, SQL, Streamlit, Docker, psycopg / psycopg2, pandas
Solutions Engineering Relevance
This project is especially relevant to junior solutions engineer roles because it demonstrates more than coding. It shows how I think through ambiguous business problems and turn them into usable technical systems.
The project demonstrates discovery and problem framing, translating business questions into a technical design, designing a relational data model around reporting needs, building internal tools for data intake, and making pragmatic tradeoffs around compliance, maintainability, and usability.
Key Technical Decisions
1. Manual-first ingestion instead of scraping
This was the most important product decision in the project. Many job boards restrict automated scraping, and for a junior role dataset, a smaller but higher quality process was the better tradeoff.
2. Preserve raw data, derive structure later
The schema stores the full raw description alongside normalized attributes. That means parsing logic can evolve without losing source data, which is important for explainability and future reprocessing.
3. Build the demo layer early
I added a Streamlit dashboard so the project was not just a backend exercise. That made it easier to inspect the dataset, demonstrate the workflow, and show how the system could be used by a recruiter, hiring manager, or analyst.
Outcome
The project established a working foundation for a job market intelligence tool focused on junior solutions engineering roles. It supports structured intake, reproducible local deployment with Docker, persistent storage in PostgreSQL, and interactive exploration through a dashboard.
What I Would Build Next
- Implement parsing and normalization modules for skill extraction and requirement classification
- Expose analytics through FastAPI endpoints
- Add charts for top skills, cloud platform demand, and essential vs desirable requirements
- Generate reusable summary reports for weekly or monthly market snapshots
Github Link
The full codebase for this project is available on GitHub:
https://github.com/JayRua/presales-skill-analysisProject information
- Category: AI Applications
- Project date: April, 2026
- Project URL: https://github.com/JayRua/job-market-intelligence