Types of Data

  • Structured Data (rows/columns CSV, Excel)

  • Semi-Structured Data (JSON / XML)

  • Unstructured Data (Video, Audio, Document, Email)

Structured Data

Semi-Structured Data

JSON

[
   {
      "id":1,
      "name":"Rachel Green",
      "gender":"F",
      "series":"Friends"
   },
   {
      "id":"2",
      "name":"Sheldon Cooper",
      "gender":"M",
      "series":"BBT"
   }
]

XML

<?xml version="1.0" encoding="UTF-8"?>
<actors>
   <actor>
      <id>1</id>
      <name>Rachel Green</name>
      <gender>F</gender>
      <series>Friends</series>
   </actor>

   <actor>
      <id>2</id>
      <name>Sheldon Cooper</name>
      <gender>M</gender>
      <series>BBT</series>
   </actor>
</actors>

Unstructured Data

  1. Text Logs: Server logs, application logs.

  2. Social Media Posts: Tweets, Facebook comments.

  3. Emails: Customer support interactions.

  4. Audio/Video: Customer call recordings and marketing videos.

  5. Customer Reviews: Free-form text reviews.

  6. Images: Product images user profile pictures.

  7. Documents: PDFs, Word files.

  8. Sensor Data: IoT data streams.

These can be ingested into modern data warehouses for analytics, often after some preprocessing. For instance, text can be analyzed with NLP before storing, or images can be processed into feature vectors.

Last updated