~/faruk-kamcici

Faruk
Kamcici

Data Engineer

Building end-to-end data products across data engineering, backend systems, geospatial analytics, and applied machine learning.

Istanbul, Türkiye

01

About

I am a 22-year-old Data Engineer based in Istanbul, recently graduated from Yeditepe University (Management Information Systems, January 2026).

I have a software development background and build end-to-end data products across data engineering, backend, geospatial analytics, and applied machine learning.

My primary career direction is to specialize deeply in Data Engineering. I also stay active in adjacent areas like backend development and applied ML because they strengthen the data products I build.

My projects cover the full lifecycle: ingesting raw files and APIs, designing ETL/ELT workflows, modeling with dbt, training ML models, exposing results through FastAPI services, and deploying production-ready systems with Docker.

I prioritize data quality and operational reliability by adding validation checks, fallback handling, and maintainable data models.

Focus Areas

  • End-to-end data systems and analytics engineering
  • Production APIs for data products
  • Geospatial data processing (PostGIS, map UI, spatial search)
  • Time-series forecasting and feature engineering
  • Applied AI (retrieval, reranking, and assistant workflows)

Outside of work, I closely follow finance and enjoy basketball.


02

Experience

Software Developer Intern

Askipo

July 2024 — January 2025 · Istanbul, Türkiye

  • Built the customer mobile app from scratch as the primary mobile developer using Ionic and TypeScript
  • Contributed to full-stack development of the customer admin panel using Angular and an internal .NET platform
  • Implemented SQL Server triggers and extended .NET APIs to deliver new features and fix production issues
IonicTypeScriptAngular.NETSQL Server

Education

BSc — Management Information Systems

Yeditepe University, Istanbul

Sept 2021 — Jan 2026 (Graduated) · GPA: 3.30

Erasmus Exchange Program

Warsaw Management University, Warsaw, Poland

2025


03

Skills

Programming

PythonSQLTypeScriptJavaScript

Data Engineering

DuckDBdbt CoreApache AirflowPolarsPandasNumPyParquetETL / ELTData ModelingData ValidationBigQuery

Databases & Search

PostgreSQLPostGISSQL ServerElasticsearchFAISSVector search

Backend & APIs

FastAPIREST APIsAPSchedulerpsycopg2JWTRedisNginx

ML & AI

LightGBMscikit-learnMLflowSHAPSentenceTransformersCrossEncoderRAGGemini API

Frontend & Data Apps

Next.jsReactPWAMapLibre GL JSReact LeafletStreamlitPlotlyTailwindCSSshadcn/ui

Infrastructure

GCPDockerDocker ComposeMakefileGitHub ActionsVercelDigitalOceanRailway

04

Projects

01

doluMu

BSc thesis — passenger crowding forecasting for Istanbul public transport

  • Built a forecasting system that predicts hourly passenger intensity for Istanbul bus and rail lines over the next 24 hours
  • Converted passenger count forecasts into occupancy metrics by combining predictions with route capacity/trip information
  • Delivered the product as a public, login-free PWA

Data & Modeling

  • ·Cleaned IBB hourly journey data and aggregated it to route-hour level
  • ·Enriched data with calendar signals and hourly weather data from Open-Meteo
  • ·Engineered lag features (24/48/168), rolling statistics, and time-based features
  • ·Trained a global LightGBM model that learns across hundreds of routes
  • ·Tracked and interpreted model behavior with MLflow and SHAP

Product & Deploy

  • ·Scheduled nightly batch forecasting — wrote predictions to database tables
  • ·Exposed ready predictions via FastAPI API
  • ·Ran FastAPI + PostgreSQL in Docker on a remote server behind Nginx with HTTPS
  • ·Built frontend in Next.js with PWA support, caching, and local storage
LightGBMMLflowSHAPPolarsPandasParquetPostgreSQLFastAPIAPSchedulerDockerNginxNext.jsReact LeafletVercel
02

Istanbul360

Interactive geospatial platform for Istanbul open data

  • Built an interactive platform to explore transport, infrastructure, demography, housing, and cultural place data on a map
  • Added district-level comparison views, place exploration, and location-aware discovery features

Data & Modeling

  • ·Ingested CSV/GeoJSON data from IBB Open Data Portal into DuckDB
  • ·Built dbt staging and mart layers combining district population, rent, housing price, and green area metrics
  • ·Linked places to districts with spatial joins and snap adjustments
  • ·Added PostGIS spatial indexes and dbt tests for data quality

Product & Deploy

  • ·Built FastAPI REST endpoints for districts, metrics, and places
  • ·Implemented typo-tolerant search with Elasticsearch fuzzy search and autocomplete
  • ·Built map UI with Next.js + MapLibre — choropleth comparisons, place clustering, layer toggles
  • ·Added RAG assistant: embedded knowledge with HuggingFace, indexed in FAISS, reranked with CrossEncoder, grounded answers via LLM
DuckDBdbt CorePostgreSQLPostGISFastAPIElasticsearchHuggingFaceFAISSGemini APINext.jsMapLibreTailwindCSSDocker ComposeDigitalOceanVercel
03

Football Data Warehouse

End-to-end football analytics (Transfermarkt 2012–2024)

  • Downloaded and cleaned Transfermarkt data from Kaggle
  • Loaded transformed data into a single-file DuckDB warehouse
  • Modeled analytics layers with dbt (stg_* and mart_* models + schema tests)
  • Served analytical data through FastAPI and built visual analytics with Streamlit + Plotly
PolarsDuckDBdbtFastAPIStreamlitPlotlyMakefileDockerRailway
04

E-commerce Behavior Analytics

Interactive dashboard for 100MB+ e-commerce event data

  • Built a layered processing workflow across user, brand, and category dimensions
  • Customer segmentation, loyalty scores, and brand-level cart abandonment analysis
  • Designed the dashboard for decision support with interactive filtering and ad-hoc exploration
PythonPandasNumPyStreamlitPlotlyStreamlit Cloud

05

Get in Touch

I'm open to data engineering roles, freelance projects, and interesting collaborations. Feel free to reach out.