Howdy Logo
Image of Jose P.

Jose P.
Data Scientist

Spark
Al
Python
Postgresql
Mysql
Mongodb
Amazon Aws
Bio

A Data Scientist and Engineer skilled in mathematics, statistics, and computer science with several years of experience in data analysis projects. Responsible for leading the entire process of Data Mining, Modeling, and Business Analytics/Science, contributing significantly to the improvement of analyses and the development of efficient methods to ensure high-quality results. Possesses experience in data engineering, including acquiring information and implementing data ingestion pipelines to support Business Analytics/Science projects.

  • Senior Data Scientist
    5/1/2023 - Present

    Developed expertise in data extraction, transformation, and ingestion (ETL) processes while working as a Data Scientist/Engineer in the banking sector. Gained proficiency in Python and PySpark for handling large-scale data processing and analysis. Demonstrated extensive knowledge in managing and querying databases using PostgreSQL, DB2, and Oracle.

  • Data Scientist and Data Engineer
    8/1/2022 - 3/1/2023

    Contributed to a B2B and Global Marketing team by extracting and manipulating large-scale, high-volume databases using PySpark. Conducted the entire ETL process, data mining, modeling, and business analytics, providing insights into operations performed by millions of customers across various sales segments. Supported business teams in making informed strategic decisions based on reliable data. Demonstrated expertise in AWS Cloud solutions, including the development of data ingestion pipelines and extraction of data via APIs such as Salesforce and SolucX. Utilized mathematical concepts such as statistical tests and distributions, and worked with an array of programming languages and tools including Python, PySpark, Glue, Lambda, Databricks, AWS S3, SQL Server, and Gitlab. Recognized patterns in data and detected consumption profiles by geographic region through modeling techniques such as Logistic Regression, Bayesian Learning, Random Forest, and Light-GBM. Transformed large amounts of unstructured data into usable formats and managed the deployment and updating of dashboards.

  • Data Analyst
    12/1/2021 - 8/1/2022

    Part of the Data Analytics team supporting the development of data analysis in the innovation ecosystem, focusing on mapping Open Innovation programs of large Brazilian companies with startups to understand their partnerships and innovation endeavors. Utilized computational tools for data analysis, non-relational database manipulation, and advanced analytics techniques. Developed a thorough documentation of the company's database, detailing collections within MongoDB. Modeled data employing Logistic Regression, Bayesian Learning, and Random Forest techniques, analyzing innovation demands of large client companies. Emphasized storytelling through insights and key performance indicators (KPIs) and implemented and maintained dashboards. Demonstrated technical proficiency with programming languages and tools including No-SQL, SQL, Python, and MongoDB.

  • Applied and Computational Mathematics at Federal University of Sergipe
    2013 - 2017

  • Computational Modeling at Federal University of Paraíba
    2018 - 2020

  • SQL for Data Science at dnc.group
    2/1/2022

  • Scrum Fundamentals at dnc.group
    2/1/2022

  • Python Zero at dnc.group
    1/1/2022

  • Big Data - Business at Semantix
    12/1/2021

  • Journey to the Legendary Dashboard at Mamba Treinamentos
    10/1/2021

Jose is available for hire

Hire Jose P.
Check icon

All Howdy Candidates are vetted for skills and english proficiency.