Juan Manuel Ortiz de Zarate
Data Scientist and Developer
Currently, Juan is a PhD candidate at the University of Buenos Aires, researching the subjects of AI, NLP, and social networks. He has over a decade of professional development experience under his belt. For the last few years, he’s been immersing himself in various types of data science projects and loving every minute of it. Juan relishes taking on data problems, building prediction models, and learning state-of-the-art techniques.
Portfolio
Experience
Data Science - 5 yearsMachine Learning - 4 yearsPython 3 - 3 yearsRStudio - 3 yearsJupyter Notebook - 3 yearsPandas - 2 yearsMatplotlib - 2 yearsScikit-learn - 2 yearsAvailability
Preferred Environment
RStudio, Jupyter Notebook
The most amazing...
...thing I've coded is a an entire web app to monitor social networks. It has a back end in R for statistics and a PHP front end.
Work Experience
Senior Data Scientist
Fundar
- Managed and supervised external students in their research.
- Managed a different kind of research project with internal and external members.
- Interviewed external collaborators and consultants to hire for specific projects and technical works.
Ph.D. Student Researcher
Universidad de Buenos Aires
- Created new techniques to analyze discussions on social networks with R and Python.
- Predicted movie reviews using the IMDB database with R.
- Predicted implication clauses with NLP through Python models.
- Developed new techniques to predict controversy with NLP techniques on social networks with R and Python.
- Created new techniques to graph clusters with NLP techniques on social networks with Python and R.
Freelance Data Scientist Advisor
Massomedia S.A.
- Predicted presidential votes via telephone surveys using Python.
- Analyzed several discussions on Twitter and Facebook using R.
- Developed a web app using R, PHP, and MySQL to monitor social networks and media.
- Built a product with Python that could analyze telephone surveys about product sales.
- Presented the results, conclusions, and explanations for each task to the client.
R Specialist
Beryl Capital Management LLC
- Configured and installed a script to download and analyze data from brokers.
- Repaired broken R libraries to fix the financial script.
- Connected to the client's computer and chatted in real-time to test the script and apply all the fixes.
Front-end Developer
Nixtla Inc.
- Developed new and custom plugins in TypeScript for JupyterLab.
- Researched JupyterLab documentation to understand how to create new features.
- Documented the creation of the new features and how they can be extended and installed in new environments.
Technical Editor
Auth0
- Corrected and edited technical articles about the latest technologies implementations.
- Tested technological implementations described in the articles.
- Evaluated writers' applications from all over the world for the company writers program.
Head Teaching Assistant
University of Buenos Aires
- Acted as the head teaching assistant at Data Organization subject. In this subject, we try to introduce students to data science. This subject is part of the mandatory career plan for informatics engineering.
- Designed all of the subject content together with another professor. Due to the COVID-19 lockdown, all of the online classes (in Spanish) are available on YouTube; if you would like to view them, contact me for the link.
- Taught half of the theoric classes and half of the practical lessons. I also have to coordinate the teachers of the practical lessons and prepare and correct the final exams and practical works.
- Designed the final test and the practical work that students must be approved to complete the subject.
Data Scientist
Carrie Beam Consulting
- Created new functionalities in R to give a graph of financial relations suggesting new unions that improved trade between the actors.
- Optimized algorithms on graphs so that they can work on large data sets.
- Found bugs in existing R code and suggested better ones.
Teaching Assistant
Universidad Católica Argentina
- Composed practical lessons for the Data Science class.
- Corrected exams and practical homework for the Data Science subject.
- Tutored and answered questions from students in the Data Science topics.
- Explained machine learning techniques, regularizations methods, feature extraction and selection, data visualization methods, and more tasks related to data science.
Statistical Developer
Decentral Park Advisors LLC (via Toptal)
- Predicted Bitcoin 1D, 3D, and 7D returns with multinomial regressions and generalized additive models (GAMs).
- Predicted Bitcoin 1D, 3D, and 7D positive or negative values through classification models as randomForest, XGBoost, and Bagging.
- Reported the results through Jupyter Notebooks and R Shiny dynamic graphs.
Data Analyst
LL Media, LLC (via Toptal)
- Standardized multiple information sources about leads using Python and Pandas.
- Scored the performance of each source over different kinds of campaigns using Python, Pandas, and Matplotlib.
- Predicted good leads by demographic data using machine learning classifiers scikit-learn.
- Predicted bad leads by demographic data using machine learning classifiers scikit-learn.
- Analyzed lead data to find simple correlations between good and bad lead performance.
Teaching Assistant
Universidad de Buenos Aires
- Composed practical lessons for the Computer Structures 1 class.
- Corrected exams and practical homework for the Computer Structures 1 class.
- Tutored and answered questions from students in the Computer Structures 1 class.
Teaching Assistant
Universidad de Buenos Aires
- Composed and gave practical lessons for the Network Theory class.
- Corrected exams and homework for the Network Theory class.
- Tutored and answered questions from students in the Network Theory class.
Senior Full-stack Developer
Telam
- Developed and maintained a system for journalists, which allowed them to write different types of notes and publish them on the news site.
- Built and maintained a system to administrate the advertisement money.
- Developed a REST API to connect with other media news sites.
Senior Full-stack Developer
Intraway
- Built and maintained a system for call center assistance.
- Developed and maintained features to communicate with the STB Systems and reset them.
- Constructed and supported a dynamic decision tree to give the best answers to clients based on their specific problems and configurations.
- Created and maintained an internal ticket system to organize tasks and assign them to different teams.
Principal Developer
Imprek
- Developed the company's administrative system.
- Built a system to administer technique services.
- Maintained the stock system.
- Implemented a system to print digital photos from a Kodak machine.
- Developed the company's financial system.
Experience
Data scientist at Fundar
Professor of Data Science at University of Buenos Aires
https://orga-de-datos.github.io/I was responsible for giving theoretical and practical classes, correcting and designing midterms, and taking final tests. Due to the COVID-19 lockdown, all of the online classes (in Spanish) are available on YouTube; if you would like to view them, contact me for the link.
Technical Editor
https://auth0.com/blog/I should also test the applications or code the authors develop in the article to see if they were well designed and implemented.
Social Network Analysis in R and Gephi: Digging Into Twitter
https://www.toptal.com/r/social-network-analysis-in-r-gephi-tutorialUnderstanding Twitter Dynamics With R and Gephi: Text Analysis and Centrality
https://www.toptal.com/r/social-network-analysis-in-r-gephi-2Ensemble Methods: The Kaggle Machine Learning Champion
https://www.toptal.com/machine-learning/ensemble-methods-kaggle-machine-learnA Glimpse Into the Future of Data Science
https://www.pangea.ai/data-science-resources/future-of-data-science/In this article, I introduce the irruption of data science in our life, how we get here, some representative cases, and where we are going.
10 Best Data Science Development Frameworks to Use in 2021
Predicting Presidential Elections Through Surveys
Using clustering and machine learning techniques, I detected to which political cluster the undecided persons were close. With that information, I could predict with high accuracy their vote.
Application to Monitor Social Networks
It has a back end in R to download information, process it, and calculate statistics and a PHP front end to show the data. I also used C to manage sessions, create, delete, modify searches, and more.
Stars Realigned: Improving the IMDb Rating System
https://www.toptal.com/data-science/improving-imdb-rating-systemIn this article, I show you how to refine IMDb scores and create a better ranking system through data science and machine learning techniques.
Predicting Kindle Reviews
Hiring Data Scientists — Best Practices and Job Description Template
In this article, I advise you on how to improve your candidates' research and hire the best profiles for your team.
Stars Realigned: Improving the IMDb Rating System
Ensemble Methods: The Kaggle Machine Learning Champion
Social Network Analysis in R and Gephi: Digging Into Twitter
Understanding Twitter Dynamics With R and Gephi: Text Analysis and Centrality
Mining for Twitter Clusters: Social Network Analysis With R and Gephi
Advantages of AI: Using GPT and Diffusion Models for Image Generation
Skills
Languages
PHP 7, Python 3, R, SQL, JavaScript, CSS, Python, PHP, TypeScript, HTML
Frameworks
RStudio Shiny, CodeIgniter, .NET
Libraries/APIs
igraph, Scikit-learn, Matplotlib, Pandas, Keras, Ggplot2, NumPy, jQuery, Caret, Bloomberg API
Paradigms
Data Science, Test-driven Development (TDD)
Platforms
RStudio, Jupyter Notebook, Linux, WordPress, Oracle, Gephi, SharePoint, Bloomberg Terminal
Storage
MySQL
Other
Data Visualization, Charts, Social Networks, Social Network Analysis, Visualization Tools, OOP Designs, Machine Learning, Data Analytics, Data Analysis, Big Data, Time Series, Time Series Analysis, Clustering, Statistics, Code Review, Source Code Review, Team Management, Interviewing, Network Theory, Computer, Structure, Education, University Teaching, Stock Market, Stock Trading, Graphs, Networks, Computer Science, Writing & Editing, Hiring, Technical Writing, Authentication, Authorization, Back-end, Front-end, Data Engineering, Forecasting, JupyterLab, Natural Language Processing (NLP), Artificial Intelligence (AI), Blogging, Blog Posting, Technical Hiring, Financial Data, GPT, Generative Pre-trained Transformers (GPT)
Tools
Dplyr, Seaborn, Jupyter, Auth0, R Studio, Bloomberg
Education
Ph.D. Degree (in Progress) in Computer Science
Universidad de Buenos Aires - Buenos Aires, Argentina
Master's Degree in Computer Science
Universidad de Buenos Aires - Buenos Aires, Argentina
Certifications
Deep Learning
Coursera
Machine Learning
Coursera
Laboratory of Machine Learning
ITBA | Instituto Tecnológico de Buenos Aires