Juan Jose Bonilla
Data Architect
Bogotá, Bogota, Colombia
6+ Years Exp
Summary
Juan is a Data Architect with expertise in Docker, Machine Learning, Jenkins, and Terraform technologies. He has led transformative initiatives, including migrating on-premises data to an AWS data lake, significantly enhancing data management and accessibility. Juan has successfully constructed a lakehouse solution using various AWS services, including S3, Redshift, Step Functions, Event Bridge, Kinesis, Glue, and Athena. He has implemented Python automation to update warehouse data, optimizing data mart creation to enhance the sources and speed of the dashboards. Juan has demonstrated proficiency in various tools and platforms, optimizing data migration, machine learning, and DAG deployments. Juan is an invaluable Data Architect with experience in orchestrating data processes across various AWS services and enhancing report generation speed and efficiency. His skills include implementing machine learning pipelines for cancel rate reduction, effectively leveraging AWS SageMaker, and improving operational efficiency. His keen eye for data organization and supply chain operations, devising strategic sales plans, and leading high-performing teams underscores his multifaceted skill set.
Technical Skills
Detailed View
Work Experience
Data Architect
Avenue Code (Contractor)
Full Time | 01/09/2022 - Present
Colombia
- Spearheaded the implementation of a data mesh architecture and successfully migrated on-premises data to an AWS data lake, enabling more efficient data management and accessibility. Contributed to the company's inaugural migration of analytical data and services to the cloud, enhancing scalability and cost-effectiveness.
- Excelled in training and mentoring junior engineers in the field of data architecture, sharing extensive knowledge and expertise to foster their professional growth.
- Implemented a Sagemaker pipeline to scale up machine learning inferences, resulting in a substantial increase of 300% in machine learning inference output, which significantly improved data analysis capabilities.
- Utilized a range of tools, including Lakeformation, Terraform, Jenkins, Docker, Sagemaker (pipelines, endpoints, studio), and AWS DataSync, to streamline and optimize data migration and machine learning processes.
- Ensured seamless ETL data flow, enabling the consolidation of data from telecommunications, proptech, and insurance domains into a unified, accessible format.
Data Architect
La Haus (Contractor
Full Time | 28/10/2021 - 06/04/2022
Colombia
- Implemented Kubernetes for Airflow deployment, resulting in a remarkable reduction of overall Directed Acyclic Graph (DAG) failures to just 3%. Additionally, the DAG deployment time was cut by 30%, significantly enhancing workflow efficiency.
- Led the migration of the data model to data-dbt and established Continuous Integration/Continuous Deployment (CICD) flows for migrating existing warehouse models to data dbt on Airflow. This migration eliminated erroneous data presentations through the enforcement of data quality checks, ensuring accurate and reliable data.
- Designed and conducted comprehensive training programs, ensuring that junior engineers acquired a deep understanding of data architecture concepts, best practices, and tools.
- Designed and implemented ETL processes for a comprehensive lakehouse architecture using AWS Glue and Redshift/Snowflake. These processes involved extracting data from diverse sources, transforming it to meet business needs, and loading it into the lakehouse.
- Optimized the lakehouse's cost by leveraging S3 storage lens and Snowflake monitors/reports. Collaborated with operations and sales teams to determine the ideal update frequency of reports, leading to more efficient resource allocation. Consulted on compute cost optimization across different departments based on query usage, resulting in a 20% reduction in data warehouse costs and a 15% improvement in query performance speed.
- Utilized a range of tools, including EKS (Elastic Kubernetes Service), Docker, CodePipeline, GitHub Actions, Datadog, Terraform, Data dbt, and Open Metadata, to orchestrate these data management and optimization processes effectively.
Lead Data Architect
E2E Nearshore
Full Time | 01/11/2019 - 01/09/2021
Colombia
- Successfully constructed a lakehouse solution using various AWS services, including S3, Redshift, Step Functions, Event Bridge, Kinesis, Glue, and Athena. This solution significantly improved report generation speed, achieving a remarkable 300% enhancement, and reduced the order processing time by 10% through effective system integrations.
- Implemented machine learning service pipelines to reduce the cancel rate using AWS SageMaker, leveraging probabilistic models and fraud detection techniques. This initiative led to a notable 7% reduction in the cancel rate, enhancing the overall operational efficiency.
- Employed a suite of tools, including Glue, Athena, Redshift, Power BI, Kinesis, Event Bridge, Lambda, API Gateway, Step Functions, and Docker, to orchestrate and optimize these data and machine learning processes effectively.
Data Specialist
Paleb SAS
Full Time | 01/11/2017 - 01/11/2019
Colombia
- Established a Postgres data warehouse to efficiently organize and manage data from various sources, including inventory, sales, customers, and suppliers. Implemented proactive measures, such as email alerts and dashboards focused on key metrics, resulting in a 10% reduction in inventory spill-over and improved inventory management.
- Deployed a Natural Language Processing (NLP) algorithm to analyze comments on social media platforms, enhancing the company's ability to respond effectively to negative feedback. This initiative led to a significant 40% improvement in addressing and managing negative comments and customer sentiments.
- Utilized a set of tools, including Python, NLP (Natural Language Processing), Tableau, Postgres, Jupyter, RDS (Relational Database Service), and Scrappy, to execute these data organization and sentiment analysis processes effectively.
BI Team Leader
Cheil
Full Time | 01/04/2019 - 01/11/2019
Colombia
- Developed dashboards tailored for Samsung's Colombian Digital Appliances (DA) division, incorporating automation for report generation and data retrieval from various business areas such as Sales, Inventory, Flooring, POP, Suppliers, and Product. This initiative led to a substantial 50% reduction in report generation time and a 10% increase in report generation efficiency.
- Implemented Python automation to update warehouse data, optimizing data mart creation to enhance the sources and speed of the dashboards. These efforts resulted in a remarkable 50% improvement in dashboard speed, enabling faster and more data-driven decision-making.
- Leveraged a combination of tools, including Tableau, Python, and Postgres, to effectively orchestrate and streamline these data management and dashboard optimization processes.
C.E.O
Colibri Trading S.A.S.
Full Time | 24/01/2013 - 09/11/2017
Colombia
- Orchestrated and optimized the company's supply chain operations to ensure a smooth and efficient flow of goods and materials, enhancing overall productivity and cost-effectiveness.
- Strategically devised and executed sales plans to meet revenue targets and maximize profitability, while closely monitoring market dynamics and customer demands.
- Led and managed a high-performing sales team, fostering a collaborative and results-driven environment, resulting in increased sales and customer satisfaction.
- Designed and implemented impactful marketing campaigns that effectively promoted the company's products or services, driving brand awareness and customer engagement while meeting campaign objectives.
Research Assistant
EAFIT University
Full Time | 19/01/2012 - 04/07/2012
Colombia
- Conducted comprehensive performance briefings of Eafit's graduates, providing insights into their achievements and contributions in their respective fields.
- Organized and executed surveys to gather valuable feedback and data related to Eafit's graduates' experiences, helping to improve the educational programs and services.
- Evaluated graduates' performance using econometric models, facilitating data-driven analysis and insights into their academic and professional development, enabling continuous improvement of the educational institution.
Education
Big Data and Machine Learning Specialist, Diploma
BSG Institute
Economics, Bachelor of Science
EAFIT University