Communities

Writing
Writing
Codidact Meta
Codidact Meta
The Great Outdoors
The Great Outdoors
Photography & Video
Photography & Video
Scientific Speculation
Scientific Speculation
Cooking
Cooking
Electrical Engineering
Electrical Engineering
Judaism
Judaism
Languages & Linguistics
Languages & Linguistics
Software Development
Software Development
Mathematics
Mathematics
Christianity
Christianity
Code Golf
Code Golf
Music
Music
Physics
Physics
Linux Systems
Linux Systems
Power Users
Power Users
Tabletop RPGs
Tabletop RPGs
Community Proposals
Community Proposals
tag:snake search within a tag
answers:0 unanswered questions
user:xxxx search by author id
score:0.5 posts with 0.5+ score
"snake oil" exact phrase
votes:4 posts with 4+ votes
created:<1w created < 1 week ago
post_type:xxxx type of post
Search help
Notifications
Mark all as read See all your notifications »
Meta

Welcome to the Judaism community on Codidact!

Will you help us build our community of learners? Drop into our study hall, ask questions, help others with answers to their questions, share a d'var torah if you're so inclined, invite your friends, and join us in building this community together. Not an ask-the-rabbi service, just people at all levels learning together.

Comments on What content do we want to import from Mi Yodeya?

Parent

What content do we want to import from Mi Yodeya?

+4
−0

We can import questions and answers from Mi Yodeya.1 You might have noticed that we imported two already, the ones needed by our "not professional advice" notice. What else to import is up to the community.

The earliest Codidact communities (Writing and Outdoors) did bulk imports, excluding only closed questions. This meant that all the content was in one place, but it also made a very large initial pile to curate. Curate, you ask? Well, we can't know who voted how on Mi Yodeya and anyway this is a new site with potentially a new community, so our policy thus far has been to reset votes on import. That means everything starts at zero and you can vote, confident that you aren't double-voting. But seeing a site full of "0" isn't ideal either.

Speaking personally, and not as a Codidact administrator, I now recommend a more intentional and phased approach to data import. That doesn't mean we can't get most or all of it if that's what we want, but we should think about what we want before asking for it.

Here are some things to know about data import, to inform this discussion:

  • Data import is scripted but requires developer intervention too; it's not "fire and forget". We would therefore like to batch import requests, accumulating a small list rather than doing posts one at a time. This might mean a delay of a few days between a request and its fulfillment.

  • As I've implied, but just to make it explicit, we don't have to do it all at once. We can do multiple imports over time.

  • We can import anything that can be expressed in a SQL query. If you can get it using the Stack Exchange Data Explorer, we should be able to get it too. This means we could restrict imported posts by tag, by score, by status (for questions), by how many answers a question has, and more.

  • We can import specific posts (like the two we started with). If there are specific posts we want, compile a list of links.

  • We can combine imports with categories. For example, if we decide to create a category for Purim Torah and we want to import some PTIJ questions from Mi Yodeya, we can make them all end up in that category instead of Q&A ("main").

How would we like to approach data import?

Update: The question of general imports is still open, but there is now a place to request import of specific questions.

  1. The Creative Commons license permits this so long as we attribute and link the source. You can see an example of how we're doing this on our Writing site (see the notice at the bottom of the post). Note that we drop this attribution for people who create accounts here and link them to their SE accounts, because those people have now directly licensed that content to us, in addition to other licenses they've granted. For example, this question was imported, but it's mine and I have an account here, so there's no attribution notice.

History
Why does this post require attention from curators or moderators?
You might want to add some details to your flag.
Why should this post be closed?

1 comment thread

General comments (12 comments)
Post
+4
−0

Regardless of what general criteria we come up with for importing data, we should accept requests for specific questions to import. I re-asked a couple of my unanswered questions here, because we didn't have data import, but that's not ideal (and will create duplicates if we ever do import the originals). Further, this do-it-yourself approach only works for your own content unless you invest a lot of effort to rewrite someone else's question in your own words. If you want to answer a question here but it was asked there, you can't do that easily.

Let's create a meta post where people can comment with links to questions they'd like to import, and once a week or so the developers can import those (or if there are a lot, a manageable subset with the rest queued) and delete the corresponding comments.

By doing smaller batches at first, we can give the new imports more attention (tagging in particular will probably need some cleanup) and also tune the process. If we start by importing with votes, we can also see if that's actually destabalizing or if that's not a concern after all.

History
Why does this post require attention from curators or moderators?
You might want to add some details to your flag.

1 comment thread

General comments (4 comments)
General comments
Dani‭ wrote over 4 years ago · edited over 3 years ago

Maybe a whole new category to make it more organized?

Monica Cellio‭ wrote over 4 years ago

@Dani I don't think we need a category or even one post per request -- a single list that everyone edits would be ideal, or, worst case, one list per requester. I want to make this easy on the developer who's going to say "give me the next 50" or whatever. I think we can do this on meta.

AA ‭ wrote over 4 years ago

I think the easiest way to do this is a dedicated meta question, where suggested imports are placed as comments on the post. Then any developer can stop by and just delete any given comment as it's processed. No excessive bumping of the post.

Monica Cellio‭ wrote over 4 years ago

@AA that's a good idea, especially about not bumping.