This professional boasts extensive expertise in information technology, specializing in support, maintenance, development, and integration of both structured and unstructured data sources. Proficient in database modeling using OLAP and OLAP with DBMS platforms such as PostgreSQL, Oracle, MySQL, and NoSQL, they demonstrate comprehensive skills in data handling. Their expertise spans the entire data analysis pipeline, from collection to transformation, processing, cleansing, and loading (ETL), utilizing sophisticated tools including Python, Spark, Pentaho, R, and Hadoop. They are adept at version control with GitHub and GitLab on both Windows and Linux operating systems.
In the domain of data science, their skills include implementing machine learning algorithms for classification and regression using scikit-learn. They are also proficient in natural language processing techniques such as bag of words, CBOW, skip-gram, fastText, and TF-IDF. Programming capabilities encompass SQL, PL/SQL, PG/SQL, Python, PySpark, and R, leveraging development tools like Data Modeler, Pentaho Data Integration, VisualCode, PyCharm, GitBash, GitHub, DBeaver, robot 3T, Hadoop, Jupyter Notebook, and RStudio.
Their experience with database management systems includes PostgreSQL, MySQL, Oracle, and MongoDB, alongside robust competencies in Linux operating systems (Red Hat, Debian, CentOS) and various Windows versions (XP, 2000, 2003, 7). Their multifaceted IT knowledge positions them to effectively contribute to projects involving support, maintenance, development, integration of data sources, and data analysis, as well as database modeling and version control within Windows and Linux environments.