Available for work
Faruk Kamçıcı
~/farukkamcici%_

Faruk
Kamçıcı

DATA ENGINEER

I design and build data pipelines, model datasets, and ship them as production-ready products.

Istanbul, Türkiye

01

About

22 years old, based in Istanbul. Recently graduated from Yeditepe University with a degree in Management Information Systems.

I come from a software development background but my focus is data and data engineering in particular. I like building things from raw data all the way to a working product: pipelines, models, APIs, and deployment.

I also work on backend systems and applied ML when the project calls for it. Most of my projects go through the full cycle: ingesting data, transforming it with dbt, training models, serving results through FastAPI, and running everything in Docker.

I care about data quality, clean models, and systems that actually hold up in production.

Focus Areas

  • End-to-end data products
  • Pipeline design and warehouse modeling
  • Forecasting and applied ML
  • Geospatial analytics
  • Scalable, production-ready data systems

Also into basketball and finance.


02

Experience

Software Developer Intern

Askipo

July 2024 — January 2025 · Istanbul, Türkiye

  • Built the customer mobile app from scratch as the primary mobile developer using Ionic and TypeScript
  • Contributed to full-stack development of the customer admin panel using Angular and .NET
  • Implemented SQL Server triggers and extended .NET APIs to deliver new features and fix production issues
IonicTypeScriptAngular.NETSQL Server

Education

BSc — Management Information Systems

Yeditepe University, Istanbul, Türkiye

Sept 2021 — Jan 2026 (Graduated) · GPA: 3.30

Erasmus Exchange Program

Warsaw Management University, Warsaw, Poland

2025 · GPA: 3.50


03

Projects

01

doluMu

BSc thesis project: passenger crowding forecasting for Istanbul public transport

  • Built a forecasting system that predicts hourly passenger intensity for Istanbul bus and rail lines over the next 24 hours
  • Converted passenger count forecasts into occupancy metrics by combining predictions with route capacity/trip information
  • Delivered the product as a public, login-free PWA

Data & Modeling

  • ·Cleaned IBB hourly journey data and aggregated it to route-hour level
  • ·Enriched data with calendar signals and hourly weather data from Open-Meteo
  • ·Engineered lag features (24/48/168), rolling statistics, and time-based features
  • ·Trained a global LightGBM model that learns across hundreds of routes
  • ·Tracked and interpreted model behavior with MLflow and SHAP

Product & Deploy

  • ·Scheduled nightly batch forecasting, wrote predictions to database tables
  • ·Exposed ready predictions via FastAPI API
  • ·Ran FastAPI + PostgreSQL in Docker on a remote server behind Nginx with HTTPS
  • ·Built frontend in Next.js with PWA support, caching, and local storage
LightGBMMLflowSHAPPolarsPandasParquetPostgreSQLFastAPIAPSchedulerDockerNginxNext.jsReact LeafletVercel
02

Istanbul360

Interactive geospatial platform for Istanbul open data

  • Built an interactive platform to explore transport, infrastructure, demography, housing, and cultural place data on a map
  • Added district-level comparison views, place exploration, and location-aware discovery features

Data & Modeling

  • ·Ingested CSV/GeoJSON data from IBB Open Data Portal into DuckDB
  • ·Built dbt staging and mart layers combining district population, rent, housing price, and green area metrics
  • ·Linked places to districts with spatial joins and snap adjustments
  • ·Added PostGIS spatial indexes and dbt tests for data quality

Product & Deploy

  • ·Built FastAPI REST endpoints for districts, metrics, and places
  • ·Implemented typo-tolerant search with Elasticsearch fuzzy search and autocomplete
  • ·Built map UI with Next.js + MapLibre with choropleth comparisons, place clustering, layer toggles
  • ·Added RAG assistant: embedded text with HuggingFace, indexed vectors in FAISS, reranked with CrossEncoder, and grounded answers with an LLM
DuckDBdbt CorePostgreSQLPostGISFastAPIElasticsearchHuggingFaceFAISSGemini APINext.jsMapLibreTailwindCSSDocker ComposeDigitalOceanVercel
03

Football Data Warehouse

Football analytics with Transfermarkt data, 2012 to 2024

  • Cleaned Transfermarkt data from Kaggle and loaded it into a DuckDB warehouse
  • Modeled staging and mart layers with dbt, added schema tests
  • Served the data through FastAPI and built visual analytics with Streamlit + Plotly
PolarsDuckDBdbtFastAPIStreamlitPlotlyMakefileDockerRailway
04

E-commerce Behavior Analytics

Interactive dashboard for 100MB+ e-commerce event data

  • Built a layered processing workflow across user, brand, and category dimensions
  • Customer segmentation, loyalty scores, and brand-level cart abandonment analysis
  • Designed the dashboard for decision support with interactive filtering and ad-hoc exploration
PythonPandasNumPyStreamlitPlotlyStreamlit Cloud

04

Skills

Programming

PythonSQLTypeScriptJavaScript

Data Engineering

DuckDBdbt CoreApache AirflowPolarsPandasParquetETL / ELTData ModelingBigQuery

Databases & Search

PostgreSQLElasticsearchFAISSRedis

Backend & APIs

FastAPIREST APIsNginx

ML & AI

LightGBMscikit-learnMLflowSHAPRAGGemini API

Frontend & Data Apps

Next.jsReactStreamlitPlotlyTailwindCSS

Infrastructure

GCPDockerDocker ComposeGitHub ActionsVercelDigitalOcean

05

Get in Touch

I'm open to data engineering roles, freelance projects, and interesting collaborations. Feel free to reach out.