We just announced our $3M Pre-Seed. Watch our — launch video.
Software Engineer by training, AI/ML engineer by heart | ex-IBM | AI & NLP & Data Viz
I'm a data-driven problem solver with a passion for building intelligent systems that (sometimes) make the world a better place. Software engineer by training and data scientist by heart.
With a solid background in machine learning, data analysis, and software engineering, I specialize in developing data-intensive applications that deliver measurable results.
As a data engineer, I can build robust software to mine, cleanse, wrangle, integrate, and store data in a database. I have also experience in business intelligence, analytics, and data visualization.
As a data scientist, I've applied my skills to a wide range of projects, from natural language processing to computer vision. I'm always eager to take on new challenges and explore cutting-edge technologies that can drive innovation and growth.
Whether you're looking for a team player who can lead by example, a creative thinker who can turn data into actionable insights, or a software developer who can turn ideas into reality, I'm the right candidate for the job.
Flask, Celery, Apache Kafka, Node.js, Firebase, Apache Spark, Django, Express
LLMs, NLP, Embeddings, Vector Databases, Tokens, LLMOps, RAG, GPUs, GPTs, FMs, LMM, GAN, Fine-Tuning, LoRa, Prompts, Alignment
Pandas, Numpy, Sklearn-learn, Bokeh, Matplotlib, Tableau, Apache Airflow, Apache MLflow, Exel, Pytorch, Numpy, SQL, SageMaker, Tensorflow, D3.js, Kubernetes, AWS, IBM Cloud
AWS Redshift, S3, DynamoDB, Redis, MongoDB, Solidity, ArduPilot, A/B Testing, Google Analytics, Adobe Analytics, Git
Follow me on Medium: https://medium.com/@tom_t
Consulting in the domain of GenAI and LLMOps, delivering custom AI app development services with a focus on advanced language models. I provide strategic consulting and hands-on development, transforming ideas into feasible solutions. My expertise includes: AI Strategy & LLMOps: Advising on AI integration and operational strategies, ensuring scalable and efficient implementation of language models in business processes. Custom AI Applications: Developing bespoke AI solutions, from GPT-based data tagging API to advanced RAG systems leveraging multiple fine-tuned LLM agents. Proof of Concept (PoC) Development: Spearhead the development of PoCs to demonstrate the practicality and potential impact of AI solutions in real-world scenarios. My role as a consultant goes beyond technical development and advisory; I am a strategic partner to my clients, helping them navigate the complex landscape of AI technology to build AI-powered tools that solve real problems.
DoiT International · Full-time
- Delivered and maintained critical ETL pipelines processing ~10TB of AWS billing data per month (SQL, Apache Beam, Numpy) - Proposed the idea and led the implementation of customized internal dashboards/admin panels that automated countless hours of manual work (1/2 FTE), as they are used daily by a team of analysts (Plotly, Dash, Streamlit, FastAPI) - Designed, implemented, and monitored all stages of MLOps for the cloud usage forecasting pipeline. Used an ensemble of ARIMA and DNN to make daily forecasts for over 2,000 customers which improved the profitability of the entire core by at least 10%. (Kubeflow, Docker, Sklearn,Pytorch) - Led the team to win the company-wide Hackathon with innovative optimization idea that will save our customer ~ $800k in a year - Used Streamlit to build a pipeline monitoring dashboard which became the default ETL debugging/ troubleshooting tool within my team DoiT International, a strategic partner of GCP and AWS, tackles complex problems of scale for our customers, using our expertise in resolving problems, coding, algorithms, complexity analysis, and system design.
- development of a Topic Detection and Tracking method combining NLP and deep learning with unsupervised document clustering - implemented a dozen Slack bots in JavaScript and AWS Lambda delivering data-driven insights to editorial teams While working full-time as a grad student I've finished and defended my master's thesis with the highest grade. Master's Thesis topic: "Design and evaluation of document embedding-based topic detection and tracking system for news articles"
My main project was called: "Machine Learning-based Ransomware detection for Cyber Resiliency" • Successfully delivered anomaly detection ML models in Python for a cybersecurity application detecting 94% of anomalies. • Contributed to the design and implementation of an ML product for IBM's Recovery Orchestration. • Researched and implemented data generation pipeline using malware Sandbox (Cuckoo) to collect the data from over 200+ ransomware strains including CryptoLocker, Jigsaw, and WannaCry. • Lead the design of the PoC based on the evaluation of state-of-the-art research in the field of Cyber Security, Cyber Resiliency, Anomaly Detection, and AI. Technologies and stack: Sklearn, Pytorch, Redis / ActiveMQ, Docker, MariaDB, MinIO, Pandas, Dask, Kubernetes IBM Cloud + Watson, Cuckoo, VMWare
Digitas Pixelpark · Work Study
Technical: - successfully delivered a CMS application developed using Node.js, Angular & MongoDB to serve customized analytics dashboards to the clients - implemented over 20+ ETL pipelines in Apache Airflow & PySpark - manged AWS architecture and carried out DevOps jobs with CloudFormation, IAM, and ECS - developed three Voice UI's and Amazon Alexa Skills for internal use Analytics: - taking responsibility for preparing custom Tableau Dashboards for brands like i.e Mercedes-Benz, McDonald or BMW - provided functional support for data analysts in MongoDB/SQL queries, Google Analytics, GTM and A/B Testing with Optimizely - build over 10+ Tableau analytics dashboards provided to clients Stack: Tableau Desktop, Tableau Server, Apache Airflow, Pandas, Flask, Amazon S3 + Redshift, SageMaker, AWS Lambda Team Presentation
Grade: GPA 3.7 / Endnote 1.3 Joint-Degree between Humbold, Technische and Freie Universität Berlin. Coursework: Machine Learning, Data Visualisation, Data Integration, Big Data Analytics, Data-driven Business, Enterprise Computing Master Thesis
Activities and societies: - Mentored younger Medieninformaitk colleagues as a part of the faculty-wide program Joint-Degree between Technische and Freie Universität Berlin. Coursework: Algorithms and Data Structures, Web technologies, Linear Algebra, Advanced Multivariate Calculus, Statistics, Media Science Bachelor Thesis: "Design, Development, and Evaluation of a Voice User Interface for Queries on semistructured Data"
Claim it to keep it up to date, or request removal. We're happy to help either way.
Chat with Clera and we'll introduce you to the right opportunities.
This profile is based on publicly available information. Tomasz is not affiliated with or endorsed by Clera. Privacy policy.