Introduction
  • Introduction
  • Coderpad
Data Ingestion
  • Overview of data scientists work
  • Sources of data & types of data
  • Data pipleline & data lake
Reading Files
  • CSV and XML
  • Working in Parquet, Avro, and ORC
  • Unstructured text and JSON
Calling APIs
  • Working with JSON
  • Making HTTP calls
  • Processing event-based data
Web Scraping
  • Find API
  • Working with Beautiful Soup
  • Working with Scrapy
  • Selenium and other considerations
Schemas
  • What are schemas?
  • Working with ontologies
  • Schema validations
Databases
  • Types of databases
  • Hosted and cost of ops
  • Working with relational databases
  • Working with key or value databases
  • Document databases and Graph databases
Troubleshooting Data
  • Troubleshooting
  • Finding Outliers
Data KPIs and Process
  • Design your data
  • KPIs
  • Monitoring
Conclusion
  • Conclusion