Communities

Writing
Writing
Codidact Meta
Codidact Meta
The Great Outdoors
The Great Outdoors
Photography & Video
Photography & Video
Scientific Speculation
Scientific Speculation
Cooking
Cooking
Electrical Engineering
Electrical Engineering
Judaism
Judaism
Languages & Linguistics
Languages & Linguistics
Software Development
Software Development
Mathematics
Mathematics
Christianity
Christianity
Code Golf
Code Golf
Music
Music
Physics
Physics
Linux Systems
Linux Systems
Power Users
Power Users
Tabletop RPGs
Tabletop RPGs
Community Proposals
Community Proposals
tag:snake search within a tag
answers:0 unanswered questions
user:xxxx search by author id
score:0.5 posts with 0.5+ score
"snake oil" exact phrase
votes:4 posts with 4+ votes
created:<1w created < 1 week ago
post_type:xxxx type of post
Search help
Notifications
Mark all as read See all your notifications »
Meta

Welcome to the Judaism community on Codidact!

Will you help us build our community of learners? Drop into our study hall, ask questions, help others with answers to their questions, share a d'var torah if you're so inclined, invite your friends, and join us in building this community together. Not an ask-the-rabbi service, just people at all levels learning together.

Comments on What content do we want to import from Mi Yodeya?

Parent

What content do we want to import from Mi Yodeya?

+4
−0

We can import questions and answers from Mi Yodeya.1 You might have noticed that we imported two already, the ones needed by our "not professional advice" notice. What else to import is up to the community.

The earliest Codidact communities (Writing and Outdoors) did bulk imports, excluding only closed questions. This meant that all the content was in one place, but it also made a very large initial pile to curate. Curate, you ask? Well, we can't know who voted how on Mi Yodeya and anyway this is a new site with potentially a new community, so our policy thus far has been to reset votes on import. That means everything starts at zero and you can vote, confident that you aren't double-voting. But seeing a site full of "0" isn't ideal either.

Speaking personally, and not as a Codidact administrator, I now recommend a more intentional and phased approach to data import. That doesn't mean we can't get most or all of it if that's what we want, but we should think about what we want before asking for it.

Here are some things to know about data import, to inform this discussion:

  • Data import is scripted but requires developer intervention too; it's not "fire and forget". We would therefore like to batch import requests, accumulating a small list rather than doing posts one at a time. This might mean a delay of a few days between a request and its fulfillment.

  • As I've implied, but just to make it explicit, we don't have to do it all at once. We can do multiple imports over time.

  • We can import anything that can be expressed in a SQL query. If you can get it using the Stack Exchange Data Explorer, we should be able to get it too. This means we could restrict imported posts by tag, by score, by status (for questions), by how many answers a question has, and more.

  • We can import specific posts (like the two we started with). If there are specific posts we want, compile a list of links.

  • We can combine imports with categories. For example, if we decide to create a category for Purim Torah and we want to import some PTIJ questions from Mi Yodeya, we can make them all end up in that category instead of Q&A ("main").

How would we like to approach data import?

Update: The question of general imports is still open, but there is now a place to request import of specific questions.

  1. The Creative Commons license permits this so long as we attribute and link the source. You can see an example of how we're doing this on our Writing site (see the notice at the bottom of the post). Note that we drop this attribution for people who create accounts here and link them to their SE accounts, because those people have now directly licensed that content to us, in addition to other licenses they've granted. For example, this question was imported, but it's mine and I have an account here, so there's no attribution notice.

History
Why does this post require attention from curators or moderators?
You might want to add some details to your flag.
Why should this post be closed?

1 comment thread

General comments (12 comments)
Post
+8
−1

I don't see why we would throw out the old vote counts. If a few people gain a few extra points on a one time basis, that's not such a big deal. The advantage gained by having good signal about post quality, more Google hits to attract people to the site, and the ability to duplicate out many basic questions seems more valuable to the community.

If the concern is one of appearances when the average new post score being smaller than the average old post score, then perhaps we can scale all old scores down by a factor of 2 (or X, with X perhaps [retroactively] changing over time as this site grows).

As a first order approximation, I'd propose bringing over any open post with an answer of score >=5 with all it's non-negatively scored components. At the same time maintain a list or location where people can submit links to specific questions to be brought over, to be processed [automatically] on a ~weekly basis.

It would also be cool to have the ability to import on demand when voting a post as a duplicate of a stackexchange post. If enough votes go through then bring over the answers immediately. Otherwise we get answers like this which just resummarize what's on Mi Yodeya, itself usually a summary of primary sources. That doesn't seem like a constructive use of people's time.

History
Why does this post require attention from curators or moderators?
You might want to add some details to your flag.

1 comment thread

General comments (3 comments)
General comments
Monica Cellio‭ wrote over 4 years ago

This makes a lot of sense to me, now that you've explained it like that.

msh210‭ wrote over 4 years ago

Upvoting for the content recommendations, not for the vote-count recommendations.

Monica Cellio‭ wrote over 4 years ago

I spun off "what to do with votes" as a separate question because of the ambiguity of voting as expressed in @msh210's comment.