<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Home on Eclipse Open Datasets</title>
    <link>/</link>
    <description>Recent content in Home on Eclipse Open Datasets</description>
    <generator>Hugo -- gohugo.io</generator><atom:link href="/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>About</title>
      <link>/about/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/about/</guid>
      <description>The Eclipse Foundation provides individuals and organizations with a commercially focused environment for open source software innovation. It includes git repositories, reviews, issues management, continuous integration, forums and mailing lists among other services. Many well-known and widely used projects are hosted on the forge, including the Eclipse IDE itself, several projects about IoT, modeling, and the new Java working group.
Crossminer &amp;amp; Scava Crossminer is EU-funded research project that aims at providing tailored recommendations for software practitionners.</description>
    </item>
    
    <item>
      <title>Datasets Privacy</title>
      <link>/privacy/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/privacy/</guid>
      <description>Introduction This document presents the datasets generated for Scava, discusses the implications it has regarding privacy, and describes what has been achieved to ensure data is safe.
All datasets are anonymised: fields that could be used to identify individuals or companies either directly or indirectly have been transformed using the Anonymise::Utility Perl module.
The intended audience of the datasets is composed of:
 Research laboratories, mainly in the field of software engineering.</description>
    </item>
    
    <item>
      <title>ecd.che</title>
      <link>/projects/ecd.che/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/ecd.che/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements. All plots and tables are computed from the actual data as provided in the downloads.</description>
    </item>
    
    <item>
      <title>ee4j.glassfish</title>
      <link>/projects/ee4j.glassfish/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/ee4j.glassfish/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>List of Eclipse Projects</title>
      <link>/projects/eclipse_projects/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/eclipse_projects/</guid>
      <description>This is the list of all Eclipse projects datasets published for Eclipse Scava.
Eclipse APP4MC  Analysis report: dataset_report_technology.app4mc.html PMI home: https://projects.eclipse.org/projects/technology.app4mc Downloads:  bugzilla_components.csv.gz bugzilla_evol.csv.gz bugzilla_issues.csv.gz bugzilla_issues_open.csv.gz bugzilla_versions.csv.gz eclipse_forums_posts.csv.gz eclipse_forums_threads.csv.gz eclipse_pmi_checks.csv.gz eclipse_pmi_checks.json.gz git_commits.csv.gz git_commits_evol.csv.gz git_log.txt.gz jenkins_builds.csv.gz jenkins_jobs.csv.gz scancode_authors.csv.gz scancode_copyrights.csv.gz scancode_files.csv.gz scancode_holders.csv.gz scancode_licences.csv.gz scancode_packages.csv.gz scancode_programming_languages.csv.gz scancode_special_files.csv.gz    Eclipse Acceleo  Analysis report: dataset_report_modeling.m2t.acceleo.html PMI home: https://projects.eclipse.org/projects/modeling.m2t.acceleo Downloads:  bugzilla_components.csv.gz bugzilla_evol.csv.gz bugzilla_issues.csv.gz bugzilla_issues_open.csv.gz bugzilla_versions.csv.gz eclipse_forums_posts.csv.gz eclipse_forums_threads.csv.gz eclipse_pmi_checks.csv.gz eclipse_pmi_checks.json.gz git_commits.csv.gz git_commits_evol.</description>
    </item>
    
    <item>
      <title>Mbox Analysis</title>
      <link>/eclipse_mls/mbox_csv_analysis/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/eclipse_mls/mbox_csv_analysis/</guid>
      <description>About this dataset This dataset is a dump of all posts sent on all mailing lists hosted at the Eclipse Forge. Although this is public data (the mailing lists can be browsed on the official mailman page) all data has been anonymised to prevent any misuse. The privacy issues identified, along with the anonymisation process, have been covered in a dedicated document.</description>
    </item>
    
    <item>
      <title>modeling.emf-parsley</title>
      <link>/projects/modeling.emf-parsley/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/modeling.emf-parsley/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>modeling.emfcompare</title>
      <link>/projects/modeling.emfcompare/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/modeling.emfcompare/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>modeling.epsilon</title>
      <link>/projects/modeling.epsilon/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/modeling.epsilon/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>modeling.gendoc</title>
      <link>/projects/modeling.gendoc/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/modeling.gendoc/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>modeling.m2t.acceleo</title>
      <link>/projects/modeling.m2t.acceleo/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/modeling.m2t.acceleo/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>modeling.mdt.ocl</title>
      <link>/projects/modeling.mdt.ocl/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/modeling.mdt.ocl/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>modeling.sirius</title>
      <link>/projects/modeling.sirius/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/modeling.sirius/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>Search</title>
      <link>/search/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/search/</guid>
      <description></description>
    </item>
    
    <item>
      <title>technology.apogy</title>
      <link>/projects/technology.apogy/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/technology.apogy/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>technology.app4mc</title>
      <link>/projects/technology.app4mc/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/technology.app4mc/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>technology.collections</title>
      <link>/projects/technology.collections/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/technology.collections/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements. All plots and tables are computed from the actual data as provided in the downloads.</description>
    </item>
    
    <item>
      <title>technology.ease</title>
      <link>/projects/technology.ease/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/technology.ease/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>technology.egit</title>
      <link>/projects/technology.egit/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/technology.egit/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>technology.epf</title>
      <link>/projects/technology.epf/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/technology.epf/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>technology.jgit</title>
      <link>/projects/technology.jgit/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/technology.jgit/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>technology.paho</title>
      <link>/projects/technology.paho/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/technology.paho/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>technology.scout</title>
      <link>/projects/technology.scout/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/technology.scout/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>technology.tycho</title>
      <link>/projects/technology.tycho/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/technology.tycho/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>tools.cdt</title>
      <link>/projects/tools.cdt/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/tools.cdt/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
    <item>
      <title>tools.tracecompass</title>
      <link>/projects/tools.tracecompass/datasets_report/</link>
      <pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate>
      
      <guid>/projects/tools.tracecompass/datasets_report/</guid>
      <description>About this document This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements.</description>
    </item>
    
  </channel>
</rss>
