Difference between revisions of "A.I. Natural Language Processing 1.0"

From Hackerspace.gr
Jump to: navigation, search
m (Updated the "what" field to be consistent with the page title)
 
(13 intermediate revisions by 5 users not shown)
Line 1: Line 1:
 
{{Event
 
{{Event
 
|logo=Natural language processing.jpg
 
|logo=Natural language processing.jpg
|what=A.I. Natural Language Processing
+
|what=A.I. Natural Language Processing 1.0
 
|tagline=Let the skynet begin
 
|tagline=Let the skynet begin
 
|eventowner=User:Skmp
 
|eventowner=User:Skmp
Line 12: Line 12:
 
|country=Greece
 
|country=Greece
 
}}
 
}}
 
 
Ευθύμης Πετρόπουλος-Τράκας "Πρώτη συνάντηση, για την Επεξεργασία Φυσικής Γλώσσας. Σας περιμένω όλους εκεί, με μπόλικη διάθεση, υλικό προς μελέτη και φυσικά ιδέες!"
 
Ευθύμης Πετρόπουλος-Τράκας "Πρώτη συνάντηση, για την Επεξεργασία Φυσικής Γλώσσας. Σας περιμένω όλους εκεί, με μπόλικη διάθεση, υλικό προς μελέτη και φυσικά ιδέες!"
  
Line 20: Line 19:
 
http://doodle.com/zrag5ak9atvzd8yd
 
http://doodle.com/zrag5ak9atvzd8yd
  
* definitions
+
= definitions =
 
** nlp - natural language processing
 
** nlp - natural language processing
 
** nlg - natural language generation
 
** nlg - natural language generation
Line 35: Line 34:
 
** Vasilis Salapatas
 
** Vasilis Salapatas
  
* referenced projects
+
 
** nusami
+
= referenced projects =
 +
** [http://www.scify.gr/site/en/our-projects/completed-projects/newsum-menu-en newsum]
 
** Standfold parser
 
** Standfold parser
** asoe
+
** asoee
*** pos-tagger (nlp.aub.gr)
+
*** pos-tagger ([http://nlp.cs.aueb.gr/ nlp.cs.aueb.gr])
 
*** Name entity recogniser (NER)
 
*** Name entity recogniser (NER)
** eclipse
+
** [http://www.eclipse.org Eclipse IDE]
*** pydev extension
+
*** [http://pydev.org pydev extension]
** simple nlg  
+
** simple nlg
** nltk
+
** [http://nltk.org NLTK] (Natural Language Toolkit) in Python
 
** wordnet
 
** wordnet
 
** ispell
 
** ispell
Line 51: Line 51:
 
** iperdiavgeia  
 
** iperdiavgeia  
 
** alibaba
 
** alibaba
 +
** Wiktionary
 +
** europarl
 +
** [http://et.gr Ethinko tipografio]
 +
** [https://podio.com podio]
 +
** oasana
 +
** wordforms
 +
** [http://duolingo.com Duolingo]
  
* Topics
+
= General chat =
 
** Introduction of each person
 
** Introduction of each person
 
** Plan is for a Study group (once per one or two weeks)
 
** Plan is for a Study group (once per one or two weeks)
Line 77: Line 84:
 
*** tries to match betwen bad/good
 
*** tries to match betwen bad/good
 
*** Implement in Greek ?
 
*** Implement in Greek ?
** Dimokritos, Openbox
+
** Dimokritos, OpenGov (useful for obtaining greek corpora?)
 
** iperdiavgeia -> search diavgeia api
 
** iperdiavgeia -> search diavgeia api
 
** provlepsi aftoktonion meso fb
 
** provlepsi aftoktonion meso fb
Line 90: Line 97:
 
*** glosiko  modelo, paragogi diafimiseon apo perigrafi, using nlg
 
*** glosiko  modelo, paragogi diafimiseon apo perigrafi, using nlg
 
*** 47k corpus is wayy too small
 
*** 47k corpus is wayy too small
 +
** scrapping
 +
*** using browser
 +
** dictionaries - parsing existing ones from the 'net
 +
*** Existing ones kind of suck
 +
** Open data in greek we can use
 +
** europarl
 +
*** hugely big corpus
 +
*** can be used to train
 +
** Ethinko tipografio
 +
*** Another open source of information
 +
** reverse dictionary
 +
** wordforms
  
 
+
= project ideas =
* project ideas
+
 
** apokodikopiisi grammikis a ~
 
** apokodikopiisi grammikis a ~
 
** basic dictionary-based nlg for greek (panagiotis)
 
** basic dictionary-based nlg for greek (panagiotis)
Line 101: Line 119:
 
** topic detection (theodim)
 
** topic detection (theodim)
 
** Corpus -- there's nothing in greek -- perhaps crawl for it ? (panagiotis)
 
** Corpus -- there's nothing in greek -- perhaps crawl for it ? (panagiotis)
**
+
 
 +
 
 +
= decisions =
 +
** doodle next meetup
 +
** investigate pontio
 +
** For next time, we'll have presentations of the tools everyone has used (more like a study group and not format presentations)
 +
** get registed on wiki !
 +
** get registed on hackerspace discuss!
 +
 
 +
= Contact =
 +
Μέσω της λίστας γενική συζήτησης βάζοντας το label [NLP] στην αρχή του subject.

Latest revision as of 10:54, 20 June 2013

Natural language processing.jpg
Starts Organizer
Sat 01 Jun 2013 15:00 Everyone interested
Ends Event Owner
Sat 01 Jun 2013 16:00 User:Skmp

Let the skynet begin



Ευθύμης Πετρόπουλος-Τράκας "Πρώτη συνάντηση, για την Επεξεργασία Φυσικής Γλώσσας. Σας περιμένω όλους εκεί, με μπόλικη διάθεση, υλικό προς μελέτη και φυσικά ιδέες!"

(coordination and note taking by skmp)


http://doodle.com/zrag5ak9atvzd8yd

definitions

    • nlp - natural language processing
    • nlg - natural language generation
  • People
    • theodim
    • skmp
    • Nikos Katzouris
    • Orestis Roditis
    • Timos Petropoulos
      • Interests ~ Cryptography (as applicable to nlp ?), Apply newer algos to greek
    • Panagiotis Katsivelis
      • Interests ~ nlg
    • Vasilis Salapatas


referenced projects

General chat

    • Introduction of each person
    • Plan is for a Study group (once per one or two weeks)
    • We'll pick a subject and work on it
      • apokodikopiisi grammikis a ~
    • Statistical vs structured algorithms
    • language discussion (it always must happen :p)
      • Standfold parser & related libraries (java-based)
    • mailing list tag decided to be [nlp]
    • language chat again
      • perhaps we can decide on a common base ?
    • nlg
      • using a dictionary
      • There's nothing for greek right now
    • nltk like, but updated w/ support for greek
    • hosting probably on github
    • back to project talk
    • ispell / spell checking
    • Sentiment analysis
      • alchemy
    • nothing in Greece really
      • Not really open/open source
    • turney algorithm
      • tries to match betwen bad/good
      • Implement in Greek ?
    • Dimokritos, OpenGov (useful for obtaining greek corpora?)
    • iperdiavgeia -> search diavgeia api
    • provlepsi aftoktonion meso fb
    • Sentiment analysis
    • topic detection is (more general than sentiment analysis)
    • Aggregation of open knowledge
    • its hard to get data
    • Scale of required corpus
      • 2M "entries" is a good number
    • alibaba
    • ptixiaki tou panagioti
      • glosiko modelo, paragogi diafimiseon apo perigrafi, using nlg
      • 47k corpus is wayy too small
    • scrapping
      • using browser
    • dictionaries - parsing existing ones from the 'net
      • Existing ones kind of suck
    • Open data in greek we can use
    • europarl
      • hugely big corpus
      • can be used to train
    • Ethinko tipografio
      • Another open source of information
    • reverse dictionary
    • wordforms

project ideas

    • apokodikopiisi grammikis a ~
    • basic dictionary-based nlg for greek (panagiotis)
    • nltk like, but updated w/ support for greek (who ?)
    • Dictionary/language graph for greek ? (theodim)
    • spell checker improvement (theodim)
    • turney-based sentiment analysis for Greek
    • topic detection (theodim)
    • Corpus -- there's nothing in greek -- perhaps crawl for it ? (panagiotis)


decisions

    • doodle next meetup
    • investigate pontio
    • For next time, we'll have presentations of the tools everyone has used (more like a study group and not format presentations)
    • get registed on wiki !
    • get registed on hackerspace discuss!

Contact

Μέσω της λίστας γενική συζήτησης βάζοντας το label [NLP] στην αρχή του subject.