Skip to content
View jeorgesilva's full-sized avatar

Highlights

  • Pro

Block or report jeorgesilva

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jeorgesilva/README.md

👋 Hi, I’m Jeorge

👨‍🏫 About Me

I’m a multilingual linguist transitioning into Data Science & NLP.
With a background in education, communication, and text analysis, I bring human-centered thinking into machine learning and data work. I’m especially interested in corpus curation, NLP, ethical AI, and accessibility through language technologies.

I’m currently training in Data Science & AI at WBS Coding School (Germany) while building portfolio projects in Python, SQL, and Linguistic Data.


🧠 Skills

🔤 Programming & Querying

  • 🐍 Python
  • 📊 SQL
  • Java (foundational)

🧩 Data & NLP Concepts

  • Data Cleaning, EDA & Feature Engineering
  • Statistical Analysis & Experiment Design (basics)
  • NLP Fundamentals: text preprocessing, tokenization, labeling
  • Corpus Review & Linguistic Data Annotation
  • Model Evaluation & Error Analysis (introductory)

🛠️ Libraries & Frameworks

  • Data & ML: Pandas, NumPy, Scikit-learn
  • Visualization: Matplotlib, Seaborn, Tableau, Looker Studio
  • Web & Data Access: Requests, BeautifulSoup, SQLAlchemy
  • Backend: Django (basics)
  • NLP: Hugging Face ecosystem (introductory)

☁️ Cloud & Tooling

  • Google Cloud Platform (basics)
  • Microsoft Azure (Basics)
  • Git & GitHub (collaboration, forks, PRs)
  • Conda, Jupyter Notebook
  • VS Code, Eclipse

🗄️ Databases

  • Relational Databases (SQL)
  • NoSQL: MongoDB (basic)

🧠 Transferable Skills (from Linguistics & Education)

  • Structured thinking & analytical reasoning
  • Clear communication of complex concepts
  • Attention to detail (language, annotation, evaluation)
  • User-centered and inclusive perspective

🖥️ Operating Systems

  • Windows, macOS

🌐 Networking Basics

  • TCP/IP (Introduction)

🤝 Soft Skills

  • Team collaboration
  • Empathy & clear communication
  • Analytical & structured thinking
  • Problem solving
  • Initiative & self-learning

📌 Projects

A user-friendly book lending system designed to track borrowed books, return dates, and overdue items.
Focus on practical problem-solving, data modeling, and user-oriented design.

Basic online banking tool featuring user registration, account management and transfers. Focus on Python fundamentals, data structures, and clean code practices.

Data analysis of a Brazilian e-commerce dataset, including logistics evaluation, revenue insights, operational reliability, and product value performance using SQL.

--

🧩 Open Source Exploration & Contributions

Production-ready template for structuring Generative AI projects.
Used to study scalable project architecture, prompt workflows, and best practices for GenAI development.

📈 Yellowbrick (Forked)

Visual analysis and diagnostic tools for machine learning model selection.
Exploring model evaluation, performance visualization, and interpretability techniques.

Machine learning pipeline for detecting fake news.
Used to explore text preprocessing, feature extraction, and supervised learning for NLP tasks.


🧪 Work in Progress

  • 🔤 Corpus Review Contributions
    Reviewing and correcting linguistic dataset entries, focusing on metadata quality and consistency
    🔗 Mozilla Common Voice

🌍 Languages

  • 🇧🇷 Portuguese – Native
  • 🇬🇧 English – C1
  • 🇩🇪 German – B2
  • 🇪🇸 Spanish – C1

🎯 My goal is to help build responsible and inclusive language technologies that empower people through data and AI.

Pinned Loading

  1. PyBank_Online_Banking_System PyBank_Online_Banking_System Public

    Python